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Introduction 


This class is a CI-M class, so it will consist primarily of student presentations. 

Looking at the syllabus, most of this class is centered around presentation, writing, and participation. The schedule 
is already posted on the Stellar website, and there is a Google Sheet where we can sign up for presentation times. Our 
first talk is just for feedback — it won't be graded. 

We'll be starting with chapter 7 of Serre’s “A Course in Arithmetic,” and then we'll move on to Diamond and 


Shurman’s “A First Course in Modular Forms.” We can get both of these texts from the library. 


Fact 1 


Notes from my own presentations will be copied from my handouts, while notes from others’ are my transcriptions. 
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Here's a few points about how to give a good presentation: 


+ Read and understand (digest) the material we're being given, and then explain it! The rest of the class will only 


see this final “explain” part, but that doesn’t mean the first two parts aren’t important. 


* Have a good lecture plan. Decide what to include and what to exclude, keeping in mind that we're trying to help 


our classmates understand the material. Also, include good examples! 
- Be ready to respond to questions. We'll have lots of discussions, and we get to ask many questions. 


+ Speak clearly and loudly. 


Organize board space well (dividing the board into smaller boards, picking blackboard order, etc.) Make sure the 
lecture “flows” well on the board, so people can go back to previous theorem statements, definitions, and so on. 


(Write in complete sentences. ) 
« Label definitions, lemmas, and so on. 


+ Make handouts for lectures and distribute them beforehand. (This also helps with bad handwriting.) Handouts 


may or may not be graded, still unclear. 


Interact with the audience — face them from time to time when lecturing, make eye contact, and pause for 


questions. 


Professor Kim has office hours (see Stellar for timeslots each week), where she can help us with presentation plans 
and material. All of us will be given comment forms to fill out for each presenter — we should make sure our comments 
are constructive, because they will be given (anonymously) to the presenter. 

For our writing project, we can pick our own topic — Professor Kim is very open-minded. 

Our main goals in this class are to understand modular forms, elliptic curves, and L-functions. So today, we'll do 


a brief introduction for motivation and to tie together some of these topics. 


Example 2 


The Riemann zeta function 


is an example of an L-function. 


This function is well-defined for all Re(s) > 1, and it has a pole at s = 1. We can also write it in the Euler product 


form 


5) =T] 5. 


¢(s) also satisfies a functional equation, which gives a meromorphic continuation to all of C. The whole idea is that 


this can contain a lot of information! 


Definition 3 (Loose definition) 


In general, an L-function takes the form 
co ae 
ns! 


L(s) = 


n=1 


where a, € C and the sequence {an} contains some arithmetic information. 


Often, L-functions can be associated with an elliptic curve L(s,£) or a modular form L(s, f) — we'll talk about 


this more later. 


Example 4 (Dirichlet series) 


Let N € Zyo, and consider a character (basically a one-dimensional representation) from (Z/nZ)* > C. 


This can be lifted to x : Z > C by defining 


7 x(a mod N gcd(a,N) =1 
x(a) = : 
0 otherwise 


We can then define the L-function 


This actually gives us the following result from chapter 6 of Serre’s book: 


Theorem 5 (Dirichlet) 


If gcd(a, N) = 1, then there exist infinitely many primes p with p = a mod N. 


Definition 6 


An elliptic curve is a smooth projective algebraic curve of genus 1 with a distinguished point. 


Rational elliptic curves can be written in the form (after transformations) 
2_ \3 
yo =x-+axt+b,abeZ 


where the discriminant 4a? + 27b? is nonzero. This is an example of a Diophantine equation, which people have been 


interested in trying to solve for integer solutions: 


Example 7 


Are there three consecutive integers whose product is a square? 


3 


(This can be rewritten as finding integral solutions to y? = x? — x.) 


Example 8 
Similarly, what positive integers are congruent (which means they're a possible area of a right triangle with rational 


side lengths)? 


This can actually be rephrased in terms of elliptic curves as well, after many changes of variables: 


Proposition 9 


3 _ n?x has rational solutions with y 4 0. 


An integer n € N is congruent if and only if y* = x 


For example, for n = 1, we have the elliptic curve y? = x? — x, which only has the solutions (0,0), (1,0), (—1, 0). 
So 1 is not a congruent number. 
We can be more precise with this problem, though. Denote C, to be the set of triples (a, b, c) € Q® corresponding 


3 


to right triangles with area n, and let E, be the solutions (x, y) € Q? to the elliptic curve y? = x?— n?x. Then there's 


a direct bijection between the solutions: 


C—a'c—a 


nb 2n? x? —n? 2nx x2 + nr? 
(a,b,c) ==> ; : 
y y y 


; ) <= (x,y). 
Elliptic curves can also be used to prove Fermat's last theorem — specifically, the relevant objects are called Frey 


curves. For any odd prime p, the idea is to consider 
y* = x(x — a?)(x + b?) 


if a? + bP = cP. (Then the discriminant is divisible by a°, b®, c?.) Frey curves are not modular, but it turns out all 
rational elliptic curves are modular! So this is a contradiction, but it takes many pages to prove this. 

So how do we construct L-functions associated to an elliptic curve E? We start with our basic form °°, #2, and 
we specify our coordinates: a, for prime p is equal to p+ 1 minus the number of rational points of E over F,, and 
similarly apn is p” + 1 minus the number of rational points of E over Fp». And then we can define amy = am - an for 


relatively prime m,n, and this allows us to determine all coefficients. 


ap 


2b 


Remark 10. /f we're interested in statistics or probability, we can consider the distribution of the coefficients 
(which are contained in [-1, 1]). This is called the Sato-Tate distribution. 


Definition 11 
The upper-half plane is defined as H = {z € C[lm(z) > O}, and the special linear group SL2(R) is defined as 


a 
the set of matrices 


| with real coefficients and determinant 1. 
Cc 


az+b 
cz+d° 


SL»(IR) acts on the upper half-plane by taking a complex number z and sending it to gz = 


Definition 12 
A function f : H — C is a modular function of weight k € 2N if 


fgz)— (ez d) Fiz). 


If f is also holomorphic on H and at infinity, then f is a modular form. 


1 1 0 il 
Let's think about H/SL2(Z): this is generated by the two elements ; : and 4 | and this gives us a 


fundamental domain. And if we take the closure, this gives us a modular curve, which is closely related to the elliptic 


curve. 


Example 13 (Quadratic forms) 


Given an n EN, can we write nas a sum of k square numbers a tee wee? Denote the number of ways to do this 


S,(n). 


For example, S2(5) = 8 (we can use positive and negative numbers, and we can swap their order). It turns out 


nin=a(i+(F))E(F). 


but how are we supposed to relate this to modular forms? Well, consider the theta function 


eA=) er = yg" 


JEZ JEZ 


there's a closed form 


(where we denote g = e277). Then this is a generating function, and the q” coefficient c, of OX is the number of 
ways to write n as the sum of k squares. And it turns out that ©* is actually a modular form — just not as defined as 
above. (We'll be more precise in the future.) It has weight x which can be pretty interesting to study as well. 


And finally, how do we construct L-functions associated to a modular form? For any modular form, f(z+1) = f(z) 


1 1 
(because z + 1 is the action of f ‘ on z). So that means we can write f as a Fourier series 


co 
f(z) = S- 2,007, 
n=0 


and the Fourier coefficients a, will go into the L-function (except tossing the constant term). This turns out to also 
have nice properties — this has something to do with “eigenvalues of the Hecke operators.” We'll see how everything 


is connected in the next few months! 
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Serre 7.1 — Dhruv Rohatgi 


We'll start by talking about the modular group, its action on Hl, and its fundamental domain. 


Definition 14 
The upper half-plane is defined to be H = {z € C: Imz > 0}. 


Definition 15 


a b 
Given any ring R, we can define SL2(R) to be the multiplicative group of 2 x 2 matrices | with a,b,c,dER 


Cc 
and ad — bc = 1. (We'll be using R = Z here.) 


We can then also define an action of SL5(Z) on the upper half-plane from SL2(Z) x H to H via 


b b 
gz = (9.2) = 2 a- | | 


cz+d' c od 


We need to check that this is a group action first — we just check that g(hz) = (gh)z, which we can do with direct 


calculation. To check that gz is in H if z is in H, notice that 


az+b __ Im((az+b)(cz+d)) 


| =| 
I cz+d |cz + dl? 


Expanding the numerator gives (ad — bc)|Imz = |Imz (since ad — bc = 1), and then we're dividing by something 
positive. So this means that gz lies in the upper half-plane. 

What can we say about this group action? Notice that —/ acts trivially on the upper half-plane (because we have 
—z divided by —1). 


Definition 16 
The projective special linear group G = PSL>2(Z) is defined as SL2(Z)/{+/}. This is also called the modular 


group. 


We can consider the induced group action of this modular group on H, and a useful thing to have is a set of 


Sel ol: Th a: 


generators. If we define 


then Sz =—4 and Tz=z+1. 


Theorem 17 


S and T generate G. 


We can prove this algebraically, but a more geometric approach works well here. Define the region 


1 1 
D={zeH:—, <Rezs 5, lz|2 1. 


Other than the boundary, we'll show that this domain contains one representative from each orbit. 


Theorem 18 


We have the following facts: 


1. There's a surjective map from D — H/G. Moreover, for all z € H, there exists a g € (S,T) such that 


gzeED. 


. This map is also “mostly” injective aside from “boundary cases:” if z,z’ € D,z # z’, but z’ = gz for some 


g GG, then Re(z7)— +, and g— for |zZ/— "Vand g— S. 


. Finally, every z € D has trivial stabilizer except for /, 0, —p, where p is —$ + 7 


Why does this imply that S and T generate G? Let z be in the interior of D, and let g € G. By (1), there is 
some h € (S, T) such that h(gz) € D, and by (2) this means hgz = z. That means by (3) that hg = /, which means 
g=hte€(S,T). 


Proof. We'll first prove surjectivity: for any z € H, there exists some n such that 7”z has real part between —$ and 
5. If this point has magnitude 1 or larger, we're done. Otherwise, apply S, which increases the imaginary coordinate, 
and then apply more Ts to fix the real part again. To prove this terminates, every time we apply an S, we have a 


sequence of imaginary parts Im(z) < Im(z) <---+, where Z = ST"z and so on. But the set 
{lm(gz) :g € GA lm(gz) > Im(z)} 


is finite, because Im(gz) = aS can only be at least Im(z) if |cz+d| < 1, which can happen only for a finite number 
of (c,d). 
To show the other parts, take z, z’ € D so that z’ = gz, g € G. Without loss of generality, say that Im(z’) > Im(z), 


which tells us that |cz + d| < 1 (just like the above part). Since we're in the fundamental domain D, the imaginary 


part of Im(z) > v3 and |cz + d| > |c|lm(z), which leaves a finite number of details to check. 


Serre 7.2.1 — Vanshika Jain 


Definition 19 
A function f is weakly modular of weight 2k, k € Z if f is meromorphic (analytic everywhere except at poles) 


on H and satisfies 


F(z) = (cz + d)-**F & es 7) 


cz+d 


a b 
for all | € SL>(Z). 
G6 d 


Using the quotient rule (and the fact that ad — bc = 1), we find that 


d(gz) _ 1 
dz (ez4+-d)?" 


so the main relation of our definition can be written as 


f(gz)d(gz)* = f(z)dz*. 


One way to interpret this is to think of the differential form f(z)dz* as being invariant under action by G. Since G is 


generated by S and T, it suffices to check invariance under S and T to check the weakly modular condition. 


Proposition 20 


Let f be a meromorphic function on Hl. Then v is weakly modular of weight 2k if and only if 
1 2k 
(2-1 all) (2) a oe ee f(z) 


(corresponding to action by T and S, respectively). 


Both of these just come from plugging in the matrices for S and T into the weakly modular condition. This tells us 
that f is periodic, so we can apply the change of variables g = e°’7 to get a new function f which is meromorphic 


on the unit disk with 0 removed. Specifically, we have 


co [oe) 
f= S- ang”, f= a aero, 
—oo —oo 
Notice that as z > joo, q > 0. 


Definition 21 
If f extends to a meromorphic (resp: holomorphic) function at the origin, then f is meromorphic (resp: holo- 


morphic) at oo. 


In these cases, the infinite sum 5° anq” only needs to sum from —m to oo and 0 to ov, respectively (corresponding 


to a pole of order m and a holomorphic function, respectively). 


Definition 22 
A weakly modular function is modular if it is meromorphic at oo (that is, f has at most a pole at 0). If f is 


holomorphic at oo, we can also define the value of f at infinity via f(oo) = f(0). 


So modular functions are analytic on H except at poles, invariant under transformations of G (up to some scaling 


factors), and f’s Laurent expansion has a finite-order pole at 0. 


Definition 23 
A modular form is a modular function that is holomorphic everywhere (including 00). A modular form which has 


value 0 at infinity is called a cusp form. 


So to recap, a modular form of weight 2k can be written in the form 


[oe] co 
f(z) _ 5 ang” = 5 ae, 
n=0 n=0 


converging in the unit disk |q| < 1, satisfying f (—) = z°*f(z) — it's a cusp form if ap = 0. 


Example 24 


If we're given two modular forms f, f’ with weight 2k, 2k’, then ff’ is a modular form of weight 2k + 2k’ — we 


can check the two equations (action under T and S$) both hold. 


Example 25 


The function 
aq] [n=11°( - 9")™ = q-24q? +--- 
[ 


is a cusp form of weight 12, and it will come up later on. 


Serre 7.2.2 - Swapnil Garg 


Definition 26 
A lattice T of a finite-dimensional vector space V (of dimension 1) is a subgroup of V isomorphic to Z”. T must 


also span V, which means that it contains a basis of V. 


Example 27 (Non-example) 
Taking the set of points (a + bv2, 0) (integer combinations of 1 and 2). The result is isomorphic to Z? but 


does not span R?, so it is not a lattice. 


If we look at lattices in R?, we can generate any lattice with two basis vectors (w1, w2). To avoid double-counting, 


we'll be a bit more precise: 


Definition 28 
Let R be the set of lattices in C if we look at it as R?, and let M be the set of ordered pairs (Wi, W2) with 
Wi, W2 € C\ 0 and roel 


Observe that all lattices can then be generated by (w1, w2), so we have a surjective map from M to R. A natural 
next question to ask is when two ordered pairs generate the same lattice — this is where SL>(Z) comes in. First, we'll 


need a lemma: 


Lemma 29 

Suppose that vi = gv2, where vy = (Wi, Wo), V2 = (wy, WS) are two-dimensional (complex) vectors and g is a 
2 x 2 matrix. If g has positive determinant, then z = ms and 2) = 4 have the same sign if and only if g has 
2. 


positive determinant. 


(This was basically proven by Dhruv’s part of the lecture. ) 


Proposition 30 


Vi, V2 € M map to the same lattice in 7 if and only if v1 = gvo for some g € SL2(Z). 


Proof. The backwards direction is clear, because v1 = gv2 Is a lattice transformation — we map our basis vectors to 
other vectors in our lattice, and we have the same unit cell size. 


The forwards direction is very similar: if v1 = (wy, W2) and vo = (wy, wS), then vy = aw, + bwe and v2 = cw, + dWo. 


a 
This means that g = ‘I and having the same unit cell cize requires the determinant to be +1. By the above 
Cc 


lemma, we know that the determinant of g is positive, so g € SL2(Z). 


This means that R is isomorphic to M/SL2(Z). Now we also want to mod out by the action of C* by scalar 


multiplication (which scale, stretch, and rotate our lattices). If we consider the map from M to H sending (wi, w2) 


to “2, notice that (wi, w2) maps to the same point as (Aw1, Aw2). So Hl is bijective with M/C*, meaning pairs of 


basis vectors mod C%* are identified with the upper half-plane. And if we mod this out by SL2(Z), we find that R/C* 
is isomorphic to H/G, where G is the modular group defined earlier. (It's okay to mod out by G rather than SL(Z) 


because +1 don't do anything.) And this is why we introduce lattice — R/C* is very closely related to H/G, which is 


closely related to the fundamental domain D. 


Definition 31 


A lattice function F : R > C has weight 2k if F(A) = A~°* F(T) for all ANC C,T ER. 


We can think of F as acting on the basis vectors instead of the whole lattice — this gives us a map F: M—>C, 


where F(w1, Wo) is F of the lattice generated by (w,, w2). The lattice function condition then becomes 
F(Aw1, Aw2) = A? F (1, wa), 
so ws*F(wy, wW2) is invariant. This motivates us to define 
Wy 2k 
f(—)= PF 
(=) - w5F (wy, W). 


Also, because F is invariant under SL2(Z), we know that 


= _ _ on, [aztb 
f(z) = F(z,1) = F(az+b,cz+d) =(cez+d) “Ff (= =) 


which is exactly the modular function condition. So adding a few extra conditions means that f can be a modular 


function! 
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Serre 7.2.3 — Christian Altamirano 


Last week, we defined modular functions, and we'll be presenting some examples today. We'll start with a lemma: 


Lemma 32 


Let [ be a lattice in C. Then >~ is convergent for @ > 2, where >> denotes a sum over all nonzero 


male 
vel [ale 


elements of the lattice. 


Proof. There are two ideas we can use here. First, we can majorize the series under (a constant times) the double 


dxdy 
02 + (x2 + y2)9/2 


by writing out the double integral as a Riemann sum. Then the double integral is easy to evaluate by using polar 


integral 


coordinates. 


Another idea is to bound the points in the lattice with |y| between n and n+ 1 -— there are O(n) of these by an 


area argument because O([n + 1]? — n*) = O(n), so the infinite sum for the lattice is convergent if and only i 


convergent because 


1 a al = 
Dar SOM =k er 
n=1 n=1 


yer 


With this, we can construct an example: 


Example 33 


Let k > 1 be an integer and let [ be a lattice. Then define a function on lattices 


/ 


G(T) = >> Hpk 


velp 


From the above lemma, this is an absolutely convergent sum — this is called the Eisenstein series of index k. We 
know that our lattice [ can be defined by two complex numbers w 1, w2, and any point can be written as mw, + nwo 


(an integer linear combination), so we can instead define 


/ 
1 
Gk(W1, W2) = 
Ds (mw, + nw2)?k 


Note that the “shape” of the lattice depends mostly on z = me we can define 
! 1 

G =G 1l)= NE 

k(Z) k(Z, 1) a (mz + n)2k 


Proposition 34 


Gx(Z) is a modular form of weight 2k for all integers k > 1, and we have G;,(oo) = 2¢(2k). 


Proof. First, we show that G,(z) is weakly modular (of weight 2k). We can show this by looking at the two transfor- 


mations under T and S: recall that f is weakly modular of weight 2k If it satisfies 


f(z)=f(z+1), f (3) = z~°k F(z), 


and we can check that both of these equations hold. 
The next step is to show G,(z) is holomorphic everywhere (including at co). Let z € D (the fundamental domain 


of the modular group) — note that 
|mz + n|? > m? — mn+ n? = |mp — ni’, 
where p = e27'/3, by a direct calculation, and thus 
G(z) < > —— 
i Cl a 


is convergent by Lemma 32 — in fact, it converges normally. Now, given any z € D and any g € G = (S,T), we get 
Gx (gz), which will also converge normally (because we're transforming the domain D). Thus Gx is holomorphic in the 


upper half-plane. 


10 


Finally, we need to show that Gx is holomorphic at oo, which is the same as showing that Gx(z) has a limit as 


Z— co. We can assume that z is in the fundamental domain (meaning we take z > joo), and then 


/ / 
. 1 : 1 
dim. Do Garp = JIM, DL Gar = 262K) 
mn neZ 


because any term with a z will disappear (the denominator becomes infinity). 


The main idea of normal convergence is that a function 
‘=i 
n 


converges normally if >>, max|f,(z)| converges. Basically, it’s a very strong condition — it implies that the series 


converges at any point. 


Serre 7.3.1 — Anton Trygub 


Definition 35 


Let f be a meromorphic function on H. Then the order of f at p is the integer nso that — 


(z—p)” 
function at p and f(p) 4 0. If n is positive, then p is called a zero of f, and if n is negative, then p is called a 


is a holomorphic 


pole of f. Denote this order v,(f). 


We know that by definition, a modular function f satsifies 


F(z) = (cz +d)*F (= ts 7) 


cz+d 


a b . 
for all | € SLo(Z). This tells us that 
Cc 
V2(f) = Vazsv (F); 
cz+d 


in other words, we have Vp(f) = Ygp(f) for any p € H and g € G. So this function v is constant on every orbit of p— 
it only depends on the image of p in H/G. 

For convenience, we'll also define the order v3,(f) to be the order of q = 0 for f(q), where the function f is defined 
so that g = e2”/7 and f(q) = F(z). 


Claim 36. Let f be a modular function of weight 2k. Then f has only finitely many zeros and poles in the fundamental 
domain D = {|Re(z)| <3, |z| > 1, lm(z) > Of. 


Proof. Since f is meromorphic at q = 0, and zeros and poles are isolated for meromorphic functions, there exists a 
neighborhood of 0 where f has no zeros and poles — specifically, there is an r > O such that f has no zeros for all 
0 <|q| <r. Because f(e2"/7) = f(z), this means that f(z) does not have any zeros or poles for Im(z) > esi) So 
any zeros or poles of f in the fundamental domain are in the compact region 


log(1/r) 


D, = {x € D: Im(x) < oF 


: 


and there can only be a finite number of zeros or poles here, as desired. 
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Definition 37 


Let e, be the order of the stabilizer of the point p under G. 


From the first theorem of the chapter (where we said that we have an injective function except with a few 


exceptions), we know that 
2 p=d(i) 
f= 13 p=g9(p) 
1 otherwise 


(in each case, for some g € G). 


Theorem 38 
Let f be a modular function of weight 2k. Then 


Our previous claim tells us that there are only finitely many points p € H/G with order different from zero, so this 


definition makes sense. We can also rewrite this with our calculation of ep: 


1 k 
Yoo + suilf) + Yo(f) 4 S> (fF) = mat 
pEH/G 
pPFII,gP 


Serre 7.3.1 continued — Zack Chroman 


We'll now prove the above theorem. We'll denote the sum over p over the domain D which are not / or op with the 


symbol >>". 
The big theorem we'll need to show this is the following: 


Theorem 39 


If we integrate a function f around a closed domain A, then 


5 F(z)dz = 5 Res,(f), 
A 


271 J5 my 


where the residue Res,(f) is defined to be the coefficient a_, in the Laurent series expansion 


co 


f(z)= D> an(z—p)". 


n=—oco 


If we apply the residue theorem to the function dz, we get the following nice corollary: 


Corollary 40 (Argument principle) 
We have 


12 


We'll skip the proof of this for now — it is an exercise in expanding out the Laurent series. 
To prove the theorem we want, we will use the residue theorem to evaluate a certain integral: we take the funda- 


mental domain D, but we avoid a neighborhood of the points p, /, and o + 1 (picture taken from Serre): 


A E 


-1 —h 0 th 1 


Here, we're picking A and E to be high enough so that all of the zeros and poles in the fundamental domain are 
below line AE, and we're also making the radius of the curves from B to B’, C to C’, and D to D’ go to 0. 

We assume that there’s no poles along the parts from A to B or E to D’ — if there are, then we just avoid those 
by drawing an identical curve on the left and right. 


Call our curve C. By the residue theorem, we know that 


7 * 
[Fe = 2ni S” vp(F), 


but we can also break this up along each of the individual parts. The integral along A to B and D’ to E are identical 
by periodicity, but we're going in opposite directions, so they cancel out. 


Now, under the transformation z +> e27/2, the line AE gets mapped to a circle w around 0, and thus 


A ¢l 
f [F . 
dz= = —2T1V(f). 
[z= [5 


(The negative sign comes from the orientation of the circle.) The rest of the curves, like from B to B’, are all partial 


circles. To evaluate these, we need a lemma: 


Lemma 41 


Integrating along a small circular arc around a pole p of angle @ yields Resp(f) - a. 


The idea is that the integral all the way around the pole is 27/Res,(f), so it’s just proportional with the arc length. 


Proof. First of all, we can ignore all power series terms except the S term (everything else goes away because it 


p 
a 
| *_dz. 
arc 4 — P 


, we have dz = 2mie?™°d@ and thus our integral becomes 


integrates to 0). Thus, we want the integral 


Substituting z = p + e27/° 


x 

da_j : Fy = ; 

~_ . Dp jePll? dg = xa} - Ini, 
0 e276] 


as desired. 
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With this, note that as r + 0, the arc from B to B’ becomes an arc of angle a (because the angle between the 


vertical line and the tangent at p is 3). This means that 


and similarly 


0 1 
Finally, note that the matrix S = ; sends the arc B/C to the arc DC” (oriented in that direction). The 


definition of a modular function tells us that 


© df(Sz) C02. dF CaF 
— 72k = = ——— — 
f(Sz) =z fle) fe as =f RE +S [ Z 


ae ee ee 
= 2k 
rf 1 ff 7 Zz 
(where the signs come from us switching the order of integration), and now this is just the residue theorem on the 
TT 2mi __ 2mik 


ote ; ; 
unit circle. The angle from B’ to C goes to 75, so this evaluates to —2k -—4 a 


Putting everything together, this means that 


[ pae= 20 (otf) ! ‘elt | at a 


This means that the sum 


but we know from before that this integral is also equal to 27/ S>* v,(f). Rearranging and canceling the 277/ gives us 


the result we want. 
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Serre 7.3.2 — David Wu 


We've been considering properties of modular forms, and now we will look at the whole space of modular forms 
together. Let M, be the space of modular forms of weight 2k, and let Me be the space of cusp forms. Both are 
vector spaces over C, because the sum of holomorphic functions is holomorphic, and 0 is always a modular form of 
weight k. 


Here’s how we can relate modular forms and cusp forms: 


Proposition 42 


We have My, = M° ® CG, (which means we add a complex number times the Eisenstein series). 


Proof. Consider the map from M, to C sending f to f(co). By definition of cusp forms, the kernel of this map is 


Mp, and the image is a subspace of C (so either all of C or just 0). But we know that G,(oo) 4 0, so we can indeed 


decompose as stated. 
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The nice thing about Eisenstein series is that we can work directly with them — in contrast, we don't actually have 


that many examples of cusp forms. But one good example is 
A = g3 — 2795, 
where g2 = 60G2 and gg = 140G3. (This is an element of Mg — a cusp form of weight 12.) 


Theorem 43 
The space of modular forms M, is trivial for k <0 and k = 1. For k = 0,2,3,4,5, M, is a vector space spanned 


by 1, Go, G3, G4, Gs respectively, and MP is trivial. Also, if we multiply by A, then we have an isomorphism between 
Mk-6 and M?. 


Basically, we can classify for small values of k and then move to higher weights as well. 


Proof. Recall from last time the formula 


* 


il 1 k 
vit 5 Milf) 3 vel) D volf) = e: 


All of these orders are nonnegative because we're working with holomorphic functions f, so we must have k > 0 to 
have a nonzero function f. Also, if k = 1, then there are no terms that can give a contribution of é — the smallest 
contribution is 4. 

Also, if we have a cusp form, then Voo(f) > 1 by definition, so the left hand side is at least 1 (and therefore for 
k <6, there are no cusp forms other than 0). By Proposition 42, this means My = CGx, so Mx is indeed just a 
one-dimensional vector space. 

Finally, let’s first prove a few properties about our discriminant function A. If we apply our counting formula above 
to Go, we find that vp(Ga) = 1 and vp(G2) = 0 for all other points p. Similarly, vi(G3) = 1 and vp(G3) = 0 for all 
other p. This means that if we look at our discriminant A = g3 — 2iag, it will be nonzero at 7 (because there is a 
zero for the g3 term, but not for the go term). Therefore, A is not identically zero, and because it is a cusp form, 
Voo(A) > 1. Again applying our counting formula, all of the other vps must be zero for A, so A is nonzero on all of H. 

So now take any f € Mp, and define g = * — this is holomorphic because A is zero everywhere except at oo (where 


f already has a zero), and that means we can find the orders of the zeros of g: 


vp(f) pFo 


Vp(f)-1 p=co 


Vp(g) = 


and computing again with the counting formula shows that we do indeed end up in Mx_¢. 


Corollary 44 


We have an explicit formula for all k > 0: 


; k =1 mod 6 
dim M, = 


| +1. otherwise. 


This tells us that these vector spaces are finite-dimensional, which is nice! 
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Proof. Applying Theorem 43 gives the desired result for k = 0,1, 2,3,4,5 (verify directly). To pass to larger k, note 
that adding 6 to k makes the right-hand side of the equation just increase by 1, and by Proposition 42, 


Mk+6 _ Mise @ CGx+6 =M,® CGxi6 


(last equality from Theorem 43), which means the dimension of Mx+6 just tacks on a one-dimensional subspace. 


Corollary 45 


M«, has a basis {G4’G3, m,n € Zs0,2mM+ 3n = k}. 


Proof. First we show that this set of vectors generate the vector space, and then we show that they are linearly 
independent. If k is small, specifically for all k < 3, we can check directly that M2 is spanned by G2 and M3 is spanned 
by G3. 

For all k > 3, we again use induction: the Chicken McNugget Theorem tells us that there exist a,@ with 
2a + 36 =k. So if we compute the weight of g = G3Ge, the weight of the modular form will be k, and it is nonzero 
at oo (because Gp and G3 are nonzero at oo). So for any modular form f € Mx, there exists > € C such that f — Ag 
is a cusp form, which means 

f-—Ag=Ah, hE Mg. 


By the inductive hypotehsis, this means we can write 
f — dg = (93 — 2793)(G3G3) 


for some a, b, and expanding this out gives the appropriate result. 


To show that these monomials are independent, say that we have a dependence relation 
arGSG8 + aGe-3Gbt? 4... =0. 


Ss 
Dividing through by a power of Ge tells us that 3 satisfies a polynomial equation, so the fundamental theorem of 
3 


2 
algebra tells us that = is completely constant, which is not true (because Go is zero at p while G3 is not, and this is 
a 


not the zero function). 


So we've shown that we can classify the vector spaces of modular forms, and we have explicit formulas for the 


basis elements and dimensions of these spaces 


Serre 7.3.3 — Shreyas Balaji 


We'll talk about a specific nice modular function: 


Definition 46 


The modular invariant is given by 


To motivate the 1728 coefficient in the numerator, let’s think about the series expansion: one way we can make 


it nice is to make the residue at oo equal to 1 (because / has a simple pole at oo, which we'll show). 
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Proposition 47 


Jj is a modular function of weight 0. 


Proof. gz is a modular form of weight 2k = 4, so g3 is a modular form of weight 12. A also has weight 12, so the 


quotient has weight zero. 


Proposition 48 


J is holomorphic on H, and it has a simple pole at oo. 


Proof. Recall from the previous section that go is holomorphic on H and at oo, and A has a simple zero at oo and is 


nonzero everywhere else. Dividing will not introduce any zeros or poles except at oo. 


Proposition 49 


Jj defines by passage to quotient a bijection between H/G > C. 


Proof. Here, “passage to quotient” means we quotient everything in the upper half plane by G — for this to be valid, 
J should be equal on the equivalence classes in HI/G, but remember that j/ has weight zero (so this is well-defined). 
To show that we have a bijection, we need to show that for any » € C, there exists a unique w € H/G such that 
Jj(w) =». 
Define the function f, : H/G > C via 
f, = 172893 — dA. 


We're trying to show that A has a unique zero: we can apply the counting zeros formula to find that 


* 


Voo(t) 4 suf) | = Ye(F) t >) v(f) = = 


H/G 


where k = 6 In this case. There are a few options: we can have a zero at oo, two zeros at /, 3 zeros at p, or one zero 


at a certain point. But there's always exactly one point at which we have a zero, which is what we want. 


Proposition 50 


Let f be a meromorphic function over H. Then the following are equivalent: 


1. f is a modular function of weight 0. 


2. f is a quotient of two modular forms of the same weight. 


3. f is a rational function of /. 


Proof. (3) implies (2) implies (1), so all we need to do is to show that (1) implies (3). Let f be a modular function of 
weight O — in fact, we can assume that f is holomorphic over HI, because it starts off meromorphic with finitely many 
poles and then we can multiply by some polynomial in / to get rid of those. 

The discriminant A is zero at oo, so the function g = A"f is holomorphic at oo for some n. Now g has weight 12n 


(because f has weight 0 and A has weight 12), so we know that g is a linear combination 


g=, S Gaeae 
2a+36=6n 
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It suffices to show that each term is a rational function of / by linearity. We have 


-_ 6368 
oy 


and we know that 2a + 36 = 6n, so a is a multiple of 3 and G is a multiple of 2. Let a = 3p and B = 2q, and now 


we can write a 
P24 

f= G3 G3 
Apta 


(the denominator p + q comes from making sure f has weight 0). And now f must be a rational function of j: 


2q 
and Se are both functions of /, and we're done. 


Serre 7.4.1 — Andrew Gu 


We'll be talking about Bernoulli numbers, which will be helpful for computing some coefficients. 


Definition 51 


The Bernoulli numbers are defined by the power series expansion 


oO 2k 
Xone x ktlp * 
cay p pe| y BOR 


The infinite sum has no odd-degree terms; to check this, note that 


x xX _ x ex+1 
Malo 2\ esi)" 


which is an even function. As an example, the first few Bernoulli numbers are 


1 1 1 
1 6" 2 30' 3 42' 


The Bernoulli numbers are sometimes defined differently via 


co 
x byx* 


ex—1 kl 
k=0 


3p 
Ga 


AP 


so we don't throw away the odd-degree terms. These give the same numbers up to some small changes, but we'll just 


use the B, (because that’s what the book uses). 


Theorem 52 


We have a formula for the zeta function at even values: for all integers k > 1, 


g2k-1 


C(2k)= Kee 


Proof. Set x = 2iz in the definition of the Bernoulli numbers to find that 


2iz Of. 22k By 72k 
2iz =l-iz > : 1 
e2/zZ — 1 = (2k)! 
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Moving the iz term to the left gives 


oO 92k a 
Bx 
| 
= (2k)! 


1 Zo =F COUZ 


by expanding out the definitions of the exponentials. But we can also find a different formula for z cot z: starting with 


the Euler product 


We apply logarithmic differentiation, meaning we send f > f. This turns products into sums: 


inz — 22] (1?) 


cosz_ 1 > —2z/(n?n?) 
sinz Zz 1 
n=1 


Multiplying both sides by z yields 


co 
tz=1-2) = 
zcotz= ; 
pa ee 


This is almost a power series expansion — we'll now expand this fraction as a power series, so 


[o-s) z2/(n2n2) CO 600 72 k 
zcotz=1 2 Nea =) poe es neq 
n= 


n=1 k=1 


is valid as long as |z| < a. Exchanging the sums yields 


1 
1 >; a > n2k 
k=1 n=1 


Comparing coefficients with our two boxed equations yields exactly what we want (we are comparing power series 


around a neighborhood of 0, so they must agree). 


5 February 13, 2020 


Serre 7.4.2 — Michelle Xu 


We'll be discussing the Eisenstein series of index k (an example of a modular form) 


- 1 
Key a (nz + m)?k 


We'll be trying to express this as a Taylor expansion in terms of q = e?'77. 


Lemma 53 


For all k > 2, we have 


if if Fe = —l,n 
erm Kv sume is 
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Proof. Recall Euler's sine product formula: taking the log derivative of the expression for sin z yields 


xcotx =1-2 y ; 
men — x2 
m=1 


Set x = wz and dividie both sides by z to yield the expression 


co 
1 Z 
Tcotmz = 2 y 5 ; 
Z m—Zz 
m=1 


and we can break up the fraction into a simpler sum 


But we can rewrite the left hand side another way: 


cosm™z . qt+l 
qT cotntz = 7— =I : 
SIN TZ q-1 
because _ 
qt+1 2cos* rz + 2isin mz cos Tz COS TZ 
— = i= ’ 
q-1  —2sin? rz + 2isintzcos Tz SIN TZ 
and we can continue to simplify this via 
gpa ie (t gg) =i e 
T7——_ = ! + — ] =/n- 27 q 
Setting these equal to each other yields 
co 
= In — 2in 0 
ae. 4 
meZ n=0 


Differentiating k — 1 times (this is why we need k > 2) yields the result. 


This allows us to get to our Taylor expansion: 


Proposition 54 
For all k > 2, we have 


2k © 
Gx(z) = 2¢(2k) + 2 aD d, O2x-1(N)q", 


where ox(n) is defined to be the sum of the kth powers of divisors of d: o,(n) = panna 


Proof. We break up the sum into the contribution from n= 0 and n 40: 


/ 


Gk(z) = Ms mek athy ae (nz + mk’ 


meZ n40 meZ 


The first sum just gives us twice ¢(2k) by similar arguments to what we've done in the past, and we can pull out a 


factor of 2 in the other sum as well: 


= 2¢(2k) + 2) SS cz Ea 


n=1 meZ 
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This is what we have in the lemma, except that we replace z with nz and k with 2k. So we can rewrite our lemma as 


» a _ (2k — yi ani) ag 1 gna 


meZ a=1 


Plugging this into our Eisenstein series yields (relabeling n as d) 


Gu(z) = 26(2K) + AOA pe 1 gda_ 


Then we get a contribution to a, of a?*~! for every d|nk, which gives us what we want. 


Corollary 55 


Let E,(Z) = 14% Dop1 Cox-1(n)q", where y= (-1)k es. Then we can rewrite 


Gx(Z) = 2¢(2k) Ex (Z). 


Proof. Last time, we found that for all k > 1, 


en ab 
2k : 
This means that 
4k(2n)?k (Qin i” 


Te = { 1) SR )IC(OK) ~ (2k — 1)!¢(2k)’ 


and this gives us the constant that we want. 


We can see some examples of how this looks for different k. 
Example 56 
Bo = 35, SO we can write 


Fo(z)=1+ 240k S > o3(n)q". 


n=1 


We also know that ¢(4) = = so we can also write 


4 


4 
Go 60Gs = 1a0C es — Ea. 


In general, every Ex, is a polynomial in Ez and E3 — this is the same argument as with the G,s. For example, 


because Eq, = Ee and Es = E2E3, we can get identities like 


n-1 


07(n) = 03(n) + 120 S- o3(m)o3(n — m). 


m=1 


Serre 7.4.3 — Andrew Lin 


Recall that a modular form is holomorphic everywhere in HI, so we can express it as a Fourier series 


f(z) = f(q) = > anq” 
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where q = e?7'2. A natural question to ask about is the order of growth of the a, (to understand the contribution of 


higher-order terms). To answer this, recall that the space of modular forms of weight 2k can be decomposed via 
Mx = M2 CGx, 
where MM? is the cusp forms, and Gx is the Eisenstein series of weight 2k. We'll study these separately: 


Proposition 57 
Let k > 2. For the modular form f = Gx, we have |a,| = @(n?*~1). In other words, there exist A,B > 0 such 
that (for all n > 1) we have 


Ane lan| < Bat. 


Proof. |In the previous section, we showed that the coefficients a, (for n > 1) satisfied 
an = (—1)*Coox-1(n), 


where C is a constant (depending on k) independent of n, and o2,«~1(n) is the sum of the (2k — 1)th powers of divisors 


of n. We can bound these magnitudes both from above and below: 
|an| = Co2x—1(n) > Crk, 


and also 


|an| 1 t 
ee cy prt S >> pekat = D <0 
d\n d>1 


(in the first equality we use the fact that summing d?k-1/n?k-! = 1/(n/d)?*-? over all divisors d is the same if 


we replace n/d with d). This means we have shown that Cn?*-! < |a,| < Dn?k-1, and thus |a,| = O(n?k~+), as 


desired. 


The subspace of cusp forms is a bit more tricky: 


Theorem 58 (Hecke) 


lan| 


ae is bounded as n> oo. 


For any cusp form f of weight 2k, we have |a,| = O(n‘); that is, 


Proof. By definition, the power series expansion of f has a9 = 0, which means that as gq + O (meaning z = x + ly 


tends to joo), the magnitude of f is O(q). In other words, 


[F(z)| = O( 24) = O(c”). 


To relate the magnitude of f to y more explicitly, note that for modular forms f of weight 2k, the function 
Im(z) 
(cz+d)?' 


$(z) = |f(z)|y* is invariant under the modular group G. This is because Im(gz) = SO 


$(9z) = |F(gz)| Im(gz)* = (|F(z)||(cz + 4)|?*) - (Im(z)*(cz + d)-**) = $(2). 


But @ is continuous (on the fundamental domain), and as the imaginary part y > oo, we know that @ goes to 0 
because the exponential term e~27” dominates the polynomial term y*. This means that @ is bounded, or that for all 
zéeH, 

$(Z) <M => |f(z)| < My. 


This is helpful, because we can now extract the a, coefficient by considering the function at If we consider 


q = e2(*+!¥) for a fixed y and sending x from 0 to 1, the contour follows a circle C once counterclockwise around 
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q = 0, and thus the residue formula tells us 


1 pes E ~ | 
Qmi Jc gutt |~ 2m Je 


(using the substitution q = e2M(xtly) —s dq= 2niqdx). And we can now bound this with the inequality 


|an| = qrti 1g" 


ra)da - fers 
0 


1 
las < | |My~*q-"|dx < My—eeny 
0 


and taking y = + gives the desired result. 


Corollary 59 


If a modular form f of weight 2k is not a cusp form, then the coefficients have order of magnitude neko} (that is, 


| an] = O(n?*~*)). 


Proof. Such modular forms of weight 2k can be written as cG, + h, where h is a cusp form and c € C £0. G,'s 


coefficients are O(n2*—!) while h’s are O(n‘), so the Gx coefficients dominate for large n. 


Remark 60. Work by Pierre Deligne has shown that Theorem 58 can be improved: it has been shown that a, = 
O(n‘—'/2g9(n)), where oo(n) denotes the number of divisors of n. Since o9(n) is subpolynomial, this tells us that for 


alle > 0, we have the stronger bound a, = O(nk- 1/24), 


Serre 7.4.4 — Nikhil Reddy 


Today, we'll talk a bit about the discriminant 
A = 93 — 2795, 


where go = 60G2 and g3 = 140G3. To get some of the constants to work out a bit better, we can also write this in 
terms of Es and E3: because 
go = 120€(4)E2, 93 = 280¢(6) Es, 


we can substitute these values in to find that 
he (2m) (12) (Bs Es). 


We know the q-series for E> and E3, so we can directly compute the first few coefficients (which are all positive 


integers): 


A = (2n)'?(q — 24q? + 252q° — 1472q*+---). 


This power series may look a bit familiar: 


Theorem 61 
We have 


A @n)-¢ |G a) 


This proof is a “bit artificial” because the natural method is to use elliptic curves. 


23 


Proof. Let F = qJ[?2,(1- q”)**: it's enough to show that F is a cusp of weight 12, and then check the coefficient 
of the q term to figure out the scaling factor (because the space of cusp forms of weight 12 has dimension 1). 


Define the two series (prime sum means we ignore (0, 0)) 


’ 1 : 1 
Gi(z) = wy (m4nzye’ G(z) = DOS rer 


Because the double sum is not absolutely summable, the order of summation here is important. Note that G, is not 


a modular form: 


Proposition 62 


We have 


Gy(2) = 2 — 22m)? ox(na" 


n=1 


il 
Gi (-3) = 2°G1(z) — 2niz. 


Proof. Earlier, we showed that for k > 2, 


G,(Z) = 2¢(2k) + 2 ony D, 024- i(n)q". 


The proof basically follows the same way if you plug in k = 1 instead and sum in the correct order (as we do). 


We'll postpone the second identity for the end. 


To show that F has weight 12, we need to show that 


We can write this more nicely as 


dq{l ig” 
- 8 (2-20 
q\q peril aoe 
and then expand as a geometric sequence to find 


= = (: — 24 3 vs aq “t _ 24S ox(n)a" ) 


nym=1 n=1 


Using the first part of Proposition 62 yields 


and now a = 2m idz tells us that 
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Plugging in —4, we find that 


and now by the second part of Proposition 62, this is 


Qn 


_ 12 2°G1(z) — 2riz a 12: 
2m 2? 


12 
G(z) + =) dz. 


But now this means that 


which means that F(—4) = kz!#F(z) by reversing the logarithmic derivative. Now looking at z = /, we find that 
F(i) = kF(i), so k = 1 (because F(/) £0). So indeed F is a modular form of weight 12, as desired. (Checking the 


constant term between F and A just comes from looking at the g-term.) 


We can now prove the second identity 
1 > : 
Gi ee ae G(z) — 2miz. 


We will need to introduce two more series 


’ 1 ! 1 
Mz) = DD (m—1+nz)(m+ nz)’ ney 2 (m—1+nz)(m+ nz) 


where we avoid both (1,0) and (0,0). These two series can be computed directly, because the series telescopes via 


1 1 1 
(m—-1+nz)(m+nz) m-14+nz m+nz 


So for H;, we have the contribution from n ¥ 0 giving us 


1 
H(z) > (m—1+nz)(m-+ nz) = 


and the n = 0 contribution just gives us 2. A more complicated calculation gives us H(z) = 2— 2m and now it turns 


out that i 
G, ( —= ) = z°G(z). 
1( ;) z°G(z) 


This is true because of how we usually write series — plugging in 4 brings up the z to the numerator but swaps the 
order of summation. So now 


/ 


oe SEE DEEL 
— (m+nz)? (m—1+nz)(m+ nz) — (nit nz)?(m—1+n2) 
is absolutely convergent (order 3), so we can add it to G and G; to find that 
271 


G-H=G,- —— SG SP 


which yields the result because 


Z 


Gi (-z) =e" (Gu) — a = z*G(z) — 2niz. 
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Serre 7.4.5 — Michael Tang 


We'll be doing some cleanup from section 2.3 (lattices and elliptic curves), and then we'll talk a bit more about the 


coefficients of A. 


Definition 63 


Let T be a lattice in the complex plane (I is isomorphic to Z? and spans C over real linear combinations). Then 


the Weierstrass p-function 


1 il 
ot = ath (aarp 


Male 


We're going to show that this function and its derivative satisfy the equation of an elliptic curve. To do this, we 


start with the Laurent expansion: 


Proposition 64 


The Laurent expansion of gr is 


pr(u) = + 22k NG(M PE, 
k=2 


where Gx(T) = yer air: 


Sketch. There's a lot of algebra, and we can look at the 18.783 lecture notes for more details. We can expand out 


G, in the proposition as a sum as well to get a double summation: 
Lo Ag (2k — 1)u2k-? 
ue + > ry2k 
k=2 yer 

This converges absolutely, so we can swap the order of summation 
i I 00 y2k-2 
=a 1) aK 

yer k=2 


For each k, this is an “arithmetico-geometric series” which we can evaluate directly. But to get the exact form of the 


Weierstrass p function, we need to do a trick: we actually add in the “odd terms” with -y2**! back in (they cancel out 


because y” cancels with (—y)"), and that will give us what we want. 


Proposition 65 
Let x = gr(u) and y = g-(u). Then 


y? = 4x? — gox — gs, 


where go = 60G2(T) and g3 = 140G3(T). 


Sketch. Define the function 
F(u) = y? — (4x? — gox — gs): 
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write out the Laurent expansion and bash, and we'll see that the negative-exponent terms cancel out, and also 
F(0) =0. This means that F is a holomorphic function at z = 0. 

Now gr and gf are both periodic with respect to the lattice, so F is also periodic. And this means that F is 
holomorphic at all points of the lattice [: a uniform convergence argument shows that F, gr, @} are all holomorphic 
at all points not on the lattice, so F is also holomorphic everywhere. 


Now F is periodic with no poles and entire, so it is bounded by the supremum on one (compact) fundamental 


domain, so it must be constant by Liouville, and thus F = 0 everywhere. 


This is the Weierstrass form for elliptic curves — we can prove that it is nonsingular, so there is something about 
an isomorphism between curves and lattices (though we don’t have the background). 
Next, we'll go back to the modular form 
A = g3 — 2793. 


Recall a few important properties here: A has weight 12, and it’s a cusp form (this explains the coefficient 27). Also, 


it has the g-expansion 


A = (2m)"q][Q-4")*. 


Definition 66 


The Ramanujan 7 function is defined such that 


Co = ae 


We can check some small values: 7(1) = 1, 7(2) = —24,7(3) = 252, 7(4) = —1472 grow pretty quickly, but we 
can give a bound on the coefficients. Recall that a cusp form of weight k has q-coefficients that are O(n‘) (and in 
fact O(n*k-/2+®). So in this case, T(n) = O(n®) and actually O(n1!/2+*), 


The 7 function has nice properties, but we'll prove them later on: 


Proposition 67 


We have the following results: 


+ T is multiplicative: for any m,n relatively prime, T(mn) = T(m)rT(n). (This means we only need the values 


at prime powers.) 


+ We have the second-order recursion (for prime p and n> 1) 
Tp) —=t(p)a(p)— pp) 


» We can write the Dirichlet series L(7, 5) = )7?°, ue as a product 


1 
Bue = Il 1—1(p)p-s + pil—25° 


pprime 


It's conjectured (unproven) that 7(n) 4 0 for all n, but this has been checked up to about 8 x 10?9. 
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Serre 7.5.1 — Natalie Stewart 


We'll start by redefining a few familiar concepts: 


Definition 68 
Let E be a set. Then Xe denotes the free abelian group on E: 


Xe= » Cxx|C € Z, all but finitely many ro] ; 
xe 


Definition 69 


A correspondence on a set E is an endomorphism 7 : Xe > Xe — an equivalent way to think about this is that 


we can write 


T(x) = oor yyy, 


yee 


which is equivalent to a set function ny : E + N® with finite support. 


Definition 70 
Let f be a function from a set E to a group G. Define 7 f : E + G as the composition 


EE Gaete oeeres 


Note that the endomorphisms on (say) an abelian group can be added pointwise and multiplied via composition, 


so they have a structure like a ring. We'll be defining a bunch of correspondences which basically form a subring. 


Definition 71 
Let 7 be the set of lattices on C. Define the correspondence T(n) by 


Ah ye. 
ER 
(Cr jn 
for all n > 1. For all A € C*, define the map from Xp to XR via 


RF =1-(AN), 


where (AI) denotes the homothetic rescaling of ’. 


Note that if [’ € T (is a sublattice) has index n, then [’ will contain nl. This gives us a chain of inclusions, and 
{r'\(P 2 1') =n} = {G €F/nl||G| = n}. 


We'll be using this a lot in the upcoming proof. This reduces the job of counting the subgroups of a certain order for 


the group ['/nl. We'll use the fact that if n is a prime, then this number is actually n+ 1. 
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Proposition 72 
For all A, uw € C* and n,m € Zyj, we have the following: 


1. Homothety operators multiply: RyRy = Rap. 
2. RxT(n) = T(n)R, (we have commutativity). 


3. T(n)T(m) = T(nm) if n, m are relatively prime. 


AS Ip (p= fp) —pilp = )R, it pis-a prime: 


Proof. (1) and (2) are clear (arguments about scaling). 

For (3), note that if (F : 1”) = nm, then the coefficient of [” in T(m)T(n)F comes from counting the lattices I’ 
where [” CT’ CF and (f : 1’) =n, and by an isomorphism theorem this is just the number of subgroups G of F/T” 
with |G| = m. Since m,n are relatively prime, it’s sufficient to check that the coefficient of [” in T(n)T(m)I is 1, 
which means there is a unique subgroup of [/T” of order m (this is true). 

For (4), note that a sublattice is represented only if its index is p”*!, so we can just check that the coefficients are 
the same there. Consider [” such that (F : 7”) = p+: from an earlier argument, I’ with index p must contain pI. 

Suppose that the coefficient of [” in T(p")T(p) is a, and the coefficient in T(p”~!)Rp is c. we wish to show that 
a=1+ pc. We break into cases: 

(case i) If [” is not contained in pl, then c = 0, and we want to show that a = 1. Note that |/pl'| = p?, so 
the order of the image of the projection from T into [/pl is p. Also, because [” is not contained in pI, it doesn’t 
intersect at all, and the order of the image of [” is also p. Since the image of I” is contained in the image of I’, we 
can use the uniqueness part of the isomorphism theorem to show that we have a bijective map between subgroups — 
there exists a unique I’. 


(case ii) If [” is contained in pI, which is contained in [’, we can check that c = 1 and a = p+1 (we did this 


above). 


Corollary 73 


Inductively, we can show that the prime powers T(p”) are polynomials in the underlying prime T(p) and the 


homothety operator Rp. Also, the ring generated by 7T(p) and Ry for p prime and A contains T(n) for all n > 1 


and its commutative. 


One important idea is that these correspondence act on our lattice functions: recall that a lattice function of 
weight 2k from FR to C satisfies 
FOI) =A “FO. 


Using the homothety operator, we can write that 
F(R,T) = 7“ F(P), 


so T(n)F is also a lattice function of weight 2k. And this gives us an extra characterization similar to the above 
proposition: 
T(m)T(n)F =T(mn)F,  T(p")T(p)F = T(p)F + pt XT (pF. 
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Serre 7.5.2 — Kaarel Haenni 


We'll be talking about sublattices of a particular lattice. Throughout this, we'll be fixing a lattice F with basis wy, We: 
define ['(n) to be the set of sublattices of T of index n. 


Definition 74 


b 
Let S, be the set of integer matrices : ] with ad=n,a>1,and0<b<d. 


Definition 75 


a b 
For any 0 = a ; , define the sublattice [, be the sublattice with basis wi = aw; + bwo and wh = dwn. 


Theorem 76 
The map o > I is a bijection from S, to F(n). 


Proof. First of all, we check that the index is indeed n: this is equal to the ratio of areas of a fundamental cell of T, 
to T, which is indeed det o = ad = n. (These two equalities follow by looking at the “cross product” of two vectors in 
the complex plane and considering the number of lattice points in one fundamental domain.) 
To show that we have a bijection, we construct an inverse map sending a sublattice [’ to a(I’). Define the two 
groups 
Y,=0/(0' + Zws), Yo = Zw/(l' OM Zwe). 


Both of these are cyclic groups, generated by wy , We in the quotient maps. Let a and d be the orders of these cyclic 
QrOUpS Wy, W2. Now defining w, = dw, we know that w, € IT’. Also, by definition of a, we know that there is some 


unique b such that aw, + bw2 € I’ as long as 0 < b<d. Then let w, = aw + bw2. Now we have a, b, d and we can 


a b 
0 dl. 


Showing that these maps are inverses is routine, so we'll skip it here. 


just define 


o(l’) = 


Example 77 


Let p be a prime. Then the sublattices of [ of index p can be easily characterized: S, consists of the matrices 


for allO <b< p. 


(This shows that the statement from Natalie's lecture that there are p+1 sublattices of index p.) These correspond 
to the sublattices wi = pw1,W = We (scaling one variable by p) and wi = Ww + bw2, ws = Pwo (scaling the other by 


p and doing a shear). 
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7 February 27, 2020 


Serre 7.5.3, 7.5.4 — Michelle Xu 


We recently introduced the operator T(n), which acts on lattices as follows: 
Tiot= >. i, 
(P:l)=n 
Today, we'll talk about this in the context of modular functions. 


Recall that we have a map between lattice functions F of weight 2k and weakly modular functions f of weight 


2k, with the correspondence 


wk F(T (wi, w2)) = F (=) 


W9 


Definition 78 


T(n) acts on functions f with the relation 


T(n)f(z) = fn “Te, 1). 


(The n?k-? is there for aesthetic reasons). Remember that the set of sublattices of index n for a lattice [(wy, w2) 


wy} ja b Wy 
wh, 0 d| |omegas|’ 
where ad =n,0< b<d,a>1. This means we can also rewrite our equation above as 


TOR SO Mae > ae (=e *) 


(F:l)=n a,d,b 


can be represented with the set of bases 


(expressing in terms of the bases). 


Proposition 79 


Let f be a weakly modular function of weight 2k. Then the function T(n)f is also weakly modular of weight 2k, 


and it is holomorphic if f is. 


Proof. Remember that the action on SL2(Z) on our lattice function keeps it invariant, so 
T(ajf(z) =" T (Fz) Hn" Ta laz +b, cz +), 


and converting this back to our weakly modular functions gives us 


az+b 
cz+d/]' 


=(cz4-d) "Taf ( 


so this is indeed of weight 2k. And now we have a finite number of terms in the boxed equation above, each of which 


is meromorphic (because f is meromorphic). So T(n)f is also meromorphic, and it is holomorphic if f is (for the same 


reason). 
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Proposition 80 


Our operator T(n) satisfies the following equations: 


+ T(n)T(m)f = T(mn)f for m,n relatively prime. 


« T(p)T(p")f = T(p"*!)f + p**1T(p"*)F for p prime and n> 1. 


Proof. Recall that we proved a very similar set of relations for lattice functions — we just plug this in along with our 
definition of an operator. 


The main point is that the p2~! factor is different; this mostly comes from the n2‘~! definition of our operator 


on modular functions. 


Proposition 81 
Let f be a modular function of weight 2k (so it is also meromorphic at infinity) with Laurent expansion f(z) = 


Dimez e(m)q™. Then T(n)f is also a modular function, we can write 


T(n)f = >) y(m)a”, 


meZ 


where y(m) = Saito en) a aG ( 


Proof. We expand f with its Laurent series to write 


T(n)f = n2k-1 s d72k Ne Elerion. 


ad=n meZ 


(here we have expanded g = e27/7). 
Fix d and m, and consider the sum Svy<peg €77/"/4. If dlm, then the exponent is a multiple of 27/, so this sum 
is just d. Otherwise, we have a sum over the roots of unity, which is 0. To use this, we define a = m’, and then we 


can write 
T(n)f = n2k-l Se de etm dig?” 


ad=n 
a>l 
meZ 


To simplify further, let «4 = am’, and now we just have a single exponent for q: 
ny 2k-1 pd 
ronte)= Soe 3 (2) (EA). 
ime=De Y (3) <(&) 
uEeZ al|(n,4),a>1 


And now this is basically what we want by relabeling indices — we just need to show T(n)f is meromorphic. Because f 
is meromorphic, its Laurent expansion stops at some N: c(m) = 0 for all m< N. This means that c (42) = 0 for all 


lu < —nWN (because the largest ¢ can be is n). This means that f is meromorphic at infinity, and because it is weakly 


modular (from earlier arguments), 7 (n)f is indeed a modular function, as desired. 


So T(n) brings modular functions to modular functions, which is nice. 
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Corollary 82 


Using the same notation as above, we have (0) = d2«_1()c(0), where o2,_1(n) is the sum of the (2k — 1)th 


divisors of n, and y(1) = c(n). Also, if n = p is prime, then we have 


c(pm) m0 (mod p) 


tom +e-te(#) m=0 (red) 


(These can be verified by directly plugging in m= 0,1 and n= p and using divisibility properties. ) 


Corollary 83 


If f is a modular form (resp. cusp form), then 7(n)f is modular (resp. cusp) as well. 


Proof. The argument follows analogously as above, replacing “meromorphic at infinity” with “holomorphic at infinity.” 


Cusp form follows from the fact that if c(0) = 0, then (0) = 0 from the above corollary. 


With this, we'll study something more specific about our T(n): eigenvalues and eigenfunctions. We'll assume that 


f = )°P 4 ¢(n)q” is a modular form of weight 2k for some k > 0. 


Definition 84 
f is an eigenfunction for all 7 (n) if we have A(n) € C such that 


T(n)f = A(n)f. 


(Then the A(n) are the set of eigenvalues for T(n).) 


Theorem 85 
Let f = So? c(n)q” be an eigenfunction. Then the coefficient of c(1) is nonzero, and if we normalize c(1) = 1, 


we have c(n) = A(n). 


Proof. We know that the coefficient of g for T(n)f, which is y(1), is equal to c(n). But by the definition of the 
eigenfunction, T(n)f should have q-coefficient A(n)c(1), so c(n) = A(n)c(1). Then c(1) can’t be zero, because 
c(n) = 0 for all n > 1 (and that means f is a constant, and the only modular form for k > 0 is trivial). Thus c(1) 4 0, 


and the second claim follows easily by setting c(1) = 1. 


Corollary 86 


If two modular forms of weight 2k are eigenfunctions of T(n) with the same eigenvalues A(n), and they are both 


normalized (to c(1) = 1), then they coincide. 


(This is because the Laurent expansion is completely defined by the eigenvalues. ) 


This allows us to say something more specific about our eigenfunctions f: 
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Corollary 87 


Suppose that f is an eigenfunction of all T(n), and it is normalized to c(1) = 1. Then 


c(m)c(n) =c(mn) YV(m,n) = 1, 


e(p)c(p") = e(p"**) + p***c(p*). 


(We plug in the properties of our operator T(n) into the eigenfunction statement. ) 


Definition 88 


Let f be an eigenfunction. Then 


O(s)= >> ay 


is the Dirichlet series defined by the coefficients of an eigenfunction f. 


This series converges absolutely for all Res > 2k, because we proved that modular forms’ coefficients have order 


of magnitude c(n) = O(n?*—?), 


Corollary 89 
Let P be the set of prime numbers. Then 


il 


Or(s) = II = 2k—1—2s ° 
pep 1 C(P)P™S + P : 


Proof. First, we want to show that we can write our Dirichlet series as 
lo) 
(s) =|] bs cot) 
peP \n=0 


(This proof comes from Serre chapter 6.) Let S be a finite set of prime numbers, and let N(S) be the set of integers 


with prime factors only in S. We know that coefficients are multiplicative, so we can rewrite 


3 cn) = I] (> como) ; 


néN(s) pes \m=0 


Now as S approaches the set of all primes, our infinite product will converge to the result above. From this, we just 


need to show the inner term works out: in other words, for all primes p, we want to show (defining Q = p-°) 


co 


n n __ 1 
2 PO = eG IGE 


To do this, we consider the series 


b(n) = (D5 e(p")Q")(1 — c(p)Q + p***Q"); 


n=0 


we wish to show that this is equal to 1. Now the Q-coefficient is equal to c(p) — c(p) = 0, and Q”*? for all n > 1 


has coefficient (expanding out the relevant terms) 
ete) epee) +p ale? *), 
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which is zero because of the recursive equation for c(p"). And this means that we only have a constant term, and 


w(n) = c(1) = 1 by normalization, completing the proof. 


Hecke also proved a few interesting facts about &,(f) — this will have a meromorphic continuation over the plane, 


just like the Riemann zeta function. 


Serre 7.5.5, 7.5.6 — Shreyas Balaji 


Throughout this section, “normalized eigenfunction” means c(1) = 1. We'll do a few examples of eigenfunctions for 


the Hecke operators f(n) first: 


Proposition 90 
The Eisenstein series G;, is an eigenfunction of T(n) with eigenvalue > = d24_1(n). Specifically, the normalized 


eigenfunction Is 
co 


B B 
ee Gl et ea, 
n=1 


where B, are the Bernoulli numbers. 


Proof. Let G;,(T) be the function from the set of lattices R to the complex numbers, where 


/ 


1 
G(T) = x, Hpk 
yer 
We can just show that Eisenstein series are eigenfunctions for all T(p) for prime p (and then use the relations to build 
up to all T(n)). Then for any prime p, 
/ 
1 
T(p)GK(T) = SY) dose. 
(T:P’)=p yer Y 
Consider a particular y € T: then there are two cases. (1) We have y € pI (that is, the lattice vectors scaled by p). 
There are p+ 1 sublattices that have index p in [, and if it’s inside pI, it’s inside all of the sublattices as well. (2) + 


is contained in exactly one sublattice. So we can split these up to find that 
T(p)Gx(T) = Ge(V) + pGx(pl) 


(the first term from every point being counted once, and the second coming from points that are counted p+1 times). 
And note that G;,(pl) = y~?*G, (I), so 


T(p)Gk(T) = (1 + p'-7*)G, (1), 


which shows that G,(I), viewed as a function on lattices, is an eigenfunction for the operator T(p). And now 
this means the modular form G, associated with the function G;(I) is an eigenfunction under T(p) with eigenvalue 


p?k-1(1 + p'~?*) = oox-1(p), as desired. (We pick up a p**—! factor when we convert between modular functions 


and modular forms.) 


Proposition 91 


Let f be defined as above. Then the Dirichlet series ®¢(s) = C(s)¢(s — 2k +1). 
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Proof. We can write out the coefficients 


oo 2k—-1 
o (s) => Pt _ 3 a 
rae ns asds ° 
n=1 a,d>1 


We can now separate this into two different sums: 


(Ga) (Se) 


which is exactly what we want. 


Proposition 92 


The A function is an eigenfunction of T(n) with normalized eigenfunction 


co 


Coo) Se ae 


n=1 


Proof. Remember that the space of cusp forms of weight 12 is of dimension 1, and this space is stable under action 


of T(n). Thus T(n)A must be some constant times A, and we can check the coefficients for normalization. L] 


Corollary 93 
The 7 function satisfies T(nm) = T(n)r(m) for all relatively prime n,m, and T(p)t(p") = T(p"t!) + p'1(p?-?) 


for all prime p and n> 1. 


(This follows from directly applying results from above about the coefficients c(n) of eigenfunctions.) 


We'll now move on to complements: we will be stating but not proving a lot of results. 


Definition 94 


Let f, g be cusp forms of weight 2k. Then define a measure pu via 
du(f, 9) = f(z)g9(z)y**dxdy/y’, 


where x, y are the real and imaginary part of z. 


Note that py is invariant under action by G (because of the y?* term) and it is bounded over H/G (because f, g 


are bounded and f and g fall off exponentially). 


Definition 95 


The Petersson scalar product on M02 is defined as 


(f, 9) =e au(f.a) = | F(z)al@)y**dxay 


where D is the fundamental domain. 
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Fact 96 


The operators T(n) turn out to be Hermitian under the Petersson scalar product: we have 


(T(n)f, 9) = (Ff, T(n)g). 


Since T(n) commute with each other, the operators are simultaneously diagonalizable. Furthermore, the image of 
f is contained within the span of the T(n) eigenvectors (the operators are complete), so there exists an orthogonal 
basis of MM° made of eigenvectors of 7(n) with real eigenvalues (because the operators are Hermitian). 


We'll move on to something completely unrelated: 


Definition 97 


Let M,(Z) be the set of weight 2k modular forms which can be written as f = Deere Cnq” for c, integers. 


It turns out that there’s a specific Z-basis of M,(Z) (it must exist because the space is stable under action of 


T(n)), which extends to a C-basis of M,. Specifically, we pick 
{ESFF : a+ 36 = k/2} 


for k even, and 
{E3ESF° > a +36 = (k — 3)/2} 
for k odd. The nice thing is that this Z-basis extends to a C-basis, so the coefficients of the characteristic polynomial 


of 7(n) must be integers (because they come from a Z-basis). 


Fact 98 (Recently proven Petersson conjecture) 


Let f = 7,5, ¢(7)q” be a cusp form of weight 2k which is also a normalized eigenfunction for T(n). Define 
; p(T) =1—c(p)T + p17? : 


then we can factor this polynomial as (1 — apT)(1— a,T), where ap + a, = c(p), apa’, = p*X?. Then ap, a, 


are complex conjugates. 


8 March 3, 2020 


Serre 7.6.1-7.6.4 — Vanshika Jain 


We'll start with a few definitions that we'll need throughout the rest of this lecture: 


Definition 99 


An invariant measure sz iS a measure preserved under translation and rotation. 


For example, the normal “product measure” dx,dx»--- dx, iS an invariant measure. 
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Definition 100 


The dual of a vector space V is the space of linear functionals (linear maps from V to the scalar field) on V, with 


structure of pointwise addition and scalar multiplication. 


Definition 101 
A rapidly decreasing (smooth) function is a function f(x) such that f, f’, f”,--- exist everywhere and decay 


faster than any negative power of x. 


(f(x) = e™ is such an example.) Note that the space of rapidly decreasing, smooth functions have the property 
that the Fourier transform is an automorphism of the space. 
Let V be a real vector space of dimension n, equipped with an invariant measure ww. Let V’ be the dual of V, and 


let f be a rapidly decreasing smooth function of V. Then the Fourier transform of f is defined to be 
Fy) = f ere equtx), 
V 


If we let F be a lattice in V, and let [’ be the dual lattice in V’ (that is, the set of y € V’ such that (x, y) = y(x) is 


an integer). Since V is a finite-dimensional vector space, there exists an isomorphism between V and V’ given by 
ve> (v,-) 
for an inner product (-,-). 


Example 102 
If we let V = R", F = Z", the dual vector space is isomorphic to R", and the lattice I’ is just Z” if we work with 


the usual dot product. 


Proposition 103 


Let v = u(V/T) (the measure of one fundamental region of [), and let f be a rapidly decreasing smooth function. 


Then 


LFW == YF. 


xer yer’ 


Proof. Replace with a scalar multiple so that the measure of the fundamental domain becomes 1. If we take a basis 
€1,--- ,@, of F, we can then identify V with R”, [ with Z”", and take pz to be the ordinary product measure. Then the 


formula we're trying to show just reduces to the classical Poisson summation formula 


Y A= + FOO, 


xeZn xeZn 


which is a property of the Fourier transform. 


We'll now apply this to quadratic forms: suppose that V is equipped with a symmetric bilinear form (x, y) that is 
positive and nondegenerate. As before, this defines an isomorphism between V and V’, so the dual lattice [’ can be 


thought of as a lattice in V instead of V’: a point y € I’ if and only if (x, y) is an integer for all x € [. 
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Definition 104 


Associate a function to each lattice TF via 


Cahn 


xer 


Proposition 105 
We have the relation (for all t € Rso and lattices I) 


Or(t) = t"/?v Or (t). 


Proof. Let f(x) = e—™**): this is rapidly decreasing and smooth, so if we pick an orthonormal basis to identify V 


with IR” and uz with the product measure, we have 
f= en M Hat x) 


Since the Fourier transform e~™° is itself, we can use the previous result: note that t'/2P has dual lattice t71/21’, 


and the volume of our lattice is t”/2v. Thus 


S- f(x) = > en Met +x) = > en ttlxp + +x7) = Or(t). 


xetl/2p xet/2r xe. 


But we can use the Poisson summation formula and do the same trick with t~!/I"’ to find that this is also equal to 
t-"/2y-1©,» (t+), as desired. 


Everything with © here can be represented using a matrix: if we let e,,--- , e, be a basis for our lattice [, we can 
define ajj = (gj, ej), which gives us a positive, symmetric, nondegenerate matrix A = (Ajj). Thus, for any x € V, we 


can write 
(x, x) = 5 aijXiXj, 
ij 


so we can write our function 


Or(t) = So er tt Dns a9, 


xjEZ 
Defining the determinant using the wedge product, we can see that the volume v of T is the square root of det A. 


Furthermore, if B = (Bij) is the inverse matrix, we have a dual basis 
cH = > bij e;. 
J 


Then the e; form a basis of T’ — we have (e/, e') = Bj. In particular, this tells us that if v’ = u(V/T"), we have 
vv = 1. 

For the upcoming section, we'll be dealing with some special cases — pairs (V,T) with a few nice properties. First 
of all, we want [’ to be equal to T, which is equivalent to saying that (x, y) is an integer for all x,y € T. This also 
implies that [’’s matrix A has integer coefficients, and A has determinant 1. 


We'll also assume that (x, x) is always even — this means that T is of type II. We'll see why this is useful soon. 
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Serre 7.6.5-7.6.7 — Andrew Lin 


In these final sections of the book, we'll discuss a particular kind of modular form with particularly nice properties. 
Much of the background work has already been done: throughout this lecture, we'll assume that we have a lattice T 
on a vector space V of dimension n, such that the fundamental domain has volume 1 and the diagonal entries of the 


matrix Ajj are even (we have a type II lattice). Since any x € T can be written as a Z-combination of basis elements 


(x, X) = pie oF 250 xixjaij 
i 


IJ 


>>, xe7, we know that 


is always even for a type II lattice. 


Definition 106 


For a lattice T, let r-(m) be the number of elements x € TF such that (x, x) = 2m. 


We know that our bilinear form is positive and nondegenerate, and thus it is essentially a rescaling of the dot 
product (because we can diagonalize the matrix A that defines it). Thus, r-(m) is bounded by a constant (depending 
on [) times the number of points in Z” of distance at most 2m from the origin. Such points are contained in the 


n-dimensional box of side length 2\/2m: thus r-(m) = O(m"’?) for any F, so r-(m) is polynomial in m. This means 


S- rr(m)q” 


m>0 


the infinite sum 


is convergent for all |q| < 1. (It’s good to make sure it’s clear why the constant term r-(0) = 1 for any lattice I.) 
Here's where we'll connect this back to our modular forms: notice that what we've written down is the Fourier 


expansion for a modular form on the upper half-plane, where g = e2/2. 


Definition 107 


The theta function associated to a lattice T is 


6-(z) = De rr (m)e2"'"2 = S- eMiZ0GX)_ 


m>=0 xer 


Because this sum is absolutely convergent and we have no poles, Or is holomorphic on the upper half-plane. 


Theorem 108 


Suppose [ satisfies the above assumptions. Then 


(a) nis a multiple of 8, 


(b) @r is a modular form of weight 4. 


Proof. The proof of the first point is outside the scope of this lecture !, but we'll do a somewhat circular proof of (a) 
once we prove (b). 


We know that 6 is holomorphic, and it satisfies the relation under 7 because we've written it as a Fourier series. 


1See chapter 5 of Serre. The main idea is that the signature of our bilinear form can be essentially related to a canonical element of 
the dual lattice of (T/2T)”, if we look at images of the dot product mod 8. 
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Thus, it suffices to check the relation under S: to do this, we will instead check that 


6r (-2) = (iz)"/?6r(z) | 


(The extra i”/? factor cancels out if n is a multiple of 8.) To show this result, note that both sides are holomorphic on 
the entire half-plane, so it suffices to show that they are equal on a set containing a limit point. Specifically, plugging 
er !Z(x.x)) 


in z = it, note that the left hand side evaluates to (using 6r(r) = dover 


while the right hand side is 
(tyre) =" S$" 6 4 O(7), 
xe. 

(Here, we use the definition of Q(z) = 37 e~™**” from a previous section.) These expressions are indeed equal (see 
Serre 7.6.2) as long as the volume v associated to our lattice is 1, which is one of our assumptions. Thus we've shown 
the boxed result, and thus the S-relation holds: @r is indeed a modular form. 

With this, we can verify that n must be a multiple of 8 (in a circular manner). Suppose not (for the sake of 
contradiction); then we can assume n = 4 (mod 8) by replacing [ with T @T (either once or twice) and still have a 


(theoretically) valid lattice. The boxed equation then becomes 


Or (-z) = —z"/29,(z), 


so the differential form w = (dz)"/*6-(z) is sent to 


(a(-3))""* a ( oe 2/?6¢(z) = —w 


Z Zz 


under S. But w is invariant under T, so ST transforms w into —w. This is a contradiction with the fact that (ST)? 


is the identity, and thus we must have had n be a multiple of 8 from the start. 


Corollary 109 


Given any lattice satisfying our assumptions, there exists a cusp form fr of weight 2k = 5 such that 6 = Ex + fr. 


Proof. Both @- and Ex, have constant term 1 in their g-expansions, so their difference is a cusp form. 


Taking this result and directly reading off coefficients yields the following: 


Corollary 110 


For any lattice [ satisfying our assumptions, we have 


rr(m) = Fo 24-1(1m) + O(m*), 


again taking k = 7. 


(Here, we need to use that the coefficients |a,| of a cusp form are O(m*), and also that k must be even.) 
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Fact 111 
In general, the cusp forms fF are not identically zero. However, Siegel has shown that a weighted mean of the ffs 
is zero: If C, is the set of isomorphism classes of our T's, and nr is the order of the automorphism group of I in 


C,, then 


Another way to say this is that (because ff = 6 — Ex) 


1 
S —oO = ( 
Ar 
TECn TECn 


Since E, is an eigenfunction of our Hecke operators T(1n), so is this weighted mean. 


With this, we can look at a few concrete examples. 
+ The smallest n where a lattice with our desired properties exists is n = 8. The theta functions then correspond 


to modular forms of weight 4, but there are no nonzero cusp forms of weight 4. Thus, we must have 


6r(z) = Eo(z) =1+ $ © 24003(m)q” 


m>1 


for any 8-dimensional lattice [. We have a matrix representation for such a lattice: 


2 —1 

2 —1 
= 2 —-l1 

—1 —1 2 =1 

Tg = 
=| 822 =i 
-1 2 -1 
=- 2? il 


(this has determinant 1), and in fact, this is the only isomorphism class of lattices in Cg. 


* The next smallest value of n is n = 16. There are still no nonzero cusp forms of weight 8 (for k = 4), so we 
know that 
Or (z) = E4(z) =1+ S © 48007(m)q”. 


m>1 
Notably, this is true even though there are two different isomorphism classes of lattices: [1g (as defined in Serre 
chapter 5)? and [fg @Tg. Since the direct sum of lattices gives a squared generating function, this yields the 
surprising identity 
2 
1+ > 48007(m)q” = (: + 2aoasn)@”) 
m2>1 m>1 


For example, looking at the m = 2 coefficient, we have 1 + 480(2’ + 1) = 1 +2 * 240(23 + 1) + 2402. 


« Finally, looking at the case n = 24 finally gives us a nonzero cusp form: the space of modular forms is spanned 


2These lattices are defined by taking a subset of the half-integer lattice points which satisfy certain conditions: for example, the sum of 
the coordinates must be an even integer. 
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by the two functions 


65520 
Es=1+ = Dooulma™, F=q][J—97)*= TI r(a)a". 
m2>1 m>1 m>1 
Thus, we can write our theta function 
6-(z) = E6 + crF 


for some constant cr which depends on the lattice. Computing this constant requires us to count the number 


of points x € T with (x, x) = 2, since computing the q-coefficients yields 


65520 
r(1) = or tT 


For example, F = Tg @Tg @Tg has r-(1) = 720, and [ = Faq has r-(1) = 1104. Of special attention is the 


Leech lattice, which has applications to coding theory and sphere packings: this particular lattice has r-(1) = 0, 


_ 65520 


SOG =" pai 


As a closing remark, needing to restrict ourselves to Type II lattices severely restricts the kinds of lattices we can 
deal with. In particular, forcing that the diagonal terms are even means that we cannot analyze quadratic forms such 
as 


2 2 2 
XT + XQ + Xp. 


to analyze, for instance, the number of ways in which a positive integer can be written as the sum of n squares. In 
order to deal with such cases, we relax our conditions slightly: now (x, x) can be any integer, so defining an analogous 
theta function can be done with respect to the subgroup of G generated by S and 7°. Such functions have “weight 
5” (may not be an integer), and the fundamental domain now has two cusps, yielding two different “Eisenstein series.” 


Further discussion will be postponed to later in this class, though. 


Q9 March 5, 2020 


The LMFDB 


Today, David Roe is here to talk to us about the L-functions and Modular Forms Database. The link to the website 
can be found at https: //www.I|Imfdb.org. 

This project is focused on creating a database of objects in computational number theory — a lot of development 
is happening at MIT and at other parts of the world. It's open-source, and we can use it to learn about some of the 
objects we've been talking about in this class! 

On the left side of the website, there’s a tab with links to various parts of the database. Modular forms, for 
example, are split up into their four types (classical, Maass, Hilbert, and Bianchi). The database also includes elliptic 
curves over Q and over number fields, genus 2 and higher genus curves (calculation gets harder as genus increases), 


and abelian varieties over finite fields. 


Remark 112. The dimension of the abelian variety attached to the elliptic curve (the Jacobian) is actually the genus 


of the curve. We might talk more about this later. 


Other parts of the left column of the website include other objects in algebraic number theory: there’s a tab for 


number fields classified by discriminant (for example, Q[Vd] for d squarefree has discriminant d if d = 1 mod 4 and 
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4d otherwise). And we define invariants similarly for other number fields. There's also a classification of Galois groups 
and Sato-Tate groups (which are related to counting points on elliptic curves mod p). 

We'll take a quick look at the classical modular forms tab. One nice feature is that there are links which 
expand out and explain terms: for example, we can click on the word newform, and we'll get a definition and further 
explanation of the mathematics there. So the purpose of the LMFDB is to provide both a research tool for experienced 
mathematicians and an expository one for those of us who are learning! 

This means that there are two ways to look for a particular modular form: first of all, we can browse by the links 


at the top of the page. 


Fact 113 


We've only been talking about modular forms of full level so far, which means that we want to transform under 


all matrices in SL2(Z). But changing the level means we only look at specific subgroups, which gives us more 


freedom. Modular forms that come up in the proof of Fermat's Last Theorem are of weight 2 and high level. 


Secondly, we can search for something more specific. For example, if we know a specific label for our modular 
form (this means we have a permanent URL), we can type it in to find its homepage. Alternatively, we can search for 
a modular form with specific properties. 

So once we find a modular form, we get to its homepage! Every page starts with some parameters and invariants 
— for example, we are told about the level and weight of the form, its analytic rank (the error counts in the point 
counts for elliptic curves give us a modular form aig + aoq? + a3q? +---, where ap is the error term for the elliptic 
curve under F,. And we can also put those coefficients into an L-function and look at the order of the zero). We're 


given the g-expansion — for weight 1, they often live in cyclotomic fields, so we see them in terms of roots of unity. 


Fact 114 
We can search for modular forms by dimension as well, which tells us about the size of the “block” when we try 


to “diagonalize’ the space of modular forms over Q instead of C. (Each block corresponds to a factor of the 


characteristic polynomial over Q, and newforms correspond to these blocks.) So a lot of calculations end up being 


linear algebra here! 


It's also good to look at the elliptic curves part of the database, which has also classified in a few nice ways. The 
discriminant is something we define directly in terms of the equation of the curve, and we also define the conductor 
(which has a more complicated definition, but it has the same prime divisors as the discriminant). We can think of 
the conductor as the level of the associated modular form! 

The individual elliptic curve homepages are interesting too: they tell us about integral points, group structure, 
certain invariants, and so on. Many pages on this database also have related pages linked on the right, which are 
helpful if we want to learn more. 

So how is this database generated? Each section has pages talking about completeness, reliability, and source 


of the data (for example, there might be assumptions like the generalized Riemann hypothesis). 
Writing topics 


Professor Kim has updated the Stellar page with sample papers and sample abstracts (from last year’s Seminar in 


Number Theory). 
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The most important (and maybe most difficult) part of writing a survey paper is to choose a topic, so that’s 
what we'll brainstorm now! Let's (for example) think about modular forms. We need to write an abstract, which is 
supposed to (briefly) summarize the main content and results of our paper. For example, the abstract might say that 
“we will define modular forms and their properties, and we'll prove the dimension formula.” It's good to say that “this 
is a survey/expository paper based on (books/papers),” since we won't really be doing original research. 

Most papers start with an introduction, which is basically an expanded summary of the content of our paper. 
But if there is some history or background for the subject, we can also talk about that to give some motivation for 
the mathematics, and we can also state a main theorem and the ingredients for its proof (if they exist). One main 
purpose for the introduction is to state the structure of the paper! For example, we might say that “in section 2, 


we do (something), and in section 3, we cover (other things),” summarizing each section. 


Remark 115. When we write an introduction, we might need to introduce new terminology which is not defined yet. 
In principle, we want to define everything before we use it, but an introduction should not have too many detours. So 


it's often good to give label numbers to where definitions are actually made! 


Let’s think about how we might structure a paper about modular forms, for example. We want to start with a 
definition — a modular form is a function satisfying a holomorphic condition, as well as invariance with respect to the 
action of SL3(Z) — but to have this definition, we need to define things like HI, the relations of SL2(Z), how SL>(Z) 
acts on the H, and so on. 

Once the definition is established, a good next step is to give some examples (Eisenstein function, discriminant, 
proving that both of these are indeed modular forms), and then start proving some properties about them. The specific 
sample paper we can see on Stellar finished by proving the dimension formula, and it also had an appendix with results 


from complex analysis (so that they aren't too distracting). 


Fact 116 
We should be careful about plagiarism in our papers — we should make sure to quote and cite our sources! We 


can label sources by numbers or by initials: for example, a book by Diamond and Shurman written in 2005 can 


be labeled as [DSO5] or as [1]. But if we're using a source, we should cite the specific theorem or proof from the 


book so that it’s easy to look up. 


Remark 117. We're writing this paper for an audience like our classmates — basically, we have some background in 


modular forms. 


10 March 10, 2020 


Diamond and Shurman 1.2 — David Wu 


We'll start off the new book by talking about congruence subgroups. First, a quick review of notation and general- 


a b 
ization: let y € ; be an element of SL2(Z). We previously defined what it means for a function to be weakly 
Cc 


modular with respect to the full modular group G: basically, f is weakly modular of weight k with respect to SL5(Z) if 
F(y(7)) = (cr + d)* F(T) 


for all y € SLo(Z),7 € H. 
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But now we'll make a generalization: we'll replace SL2(Z) with a congruence subgroup [, which will allow weights 
to be any nonnegative integer k. 
What is causing this generalization? We know that —/ is an element of SL>(Z), and the group action —/(T) = T 


fixes complex numbers 7 € H. So previously, when k was odd, the definition of a modular function would force 
f(r) = (-1)*f(7), 


which means that f is trivial. The main idea here is that we won't need to include —/ in our congruence subgroup! 


At the end of the last lecture, we were talking about theta functions, and we introduced (for a lattice I) 
r(m) = #{x ET: (x, x) = 2m}. 


We required gnarly conditions on T last time to make sure we had a modular form when we assembled the generating 


function 6. In particular, we (sort of) showed last time that 


6, (-:) = (iz)"/*6r(z). 


Let’s define this more rigorously now: 


Example 118 
Let r(n, k) be the number of lattice points v € Z* such that 


This defines a theta function 6(7, k) = )>?°., r(n, k)q” (where q = e277). 


This is mostly as a sketch — we won't show that this actually converges as a power series or anything like that 
— but the reason this function is important is that we have Legendre’s four squares theorem, which tells us that 
any nonnegative integer can be written as the sum of four squares. This can be rephrased as saying that 0(7, 4) has 
all nonzero coefficients in its g-expansion: once we develop a bit more theory, we'll be able to make progress on this 


question! 
Claim 119. 6(7, 4) is a modular form of weight 2 with respect to a specific congruence subgroup I. 


Proof sketch. First, we can notice that if + / = k, then we have the convolution 


r(n,k)= S$ r(é,ir(m,J). 


é+m=n 


Basically, if we want to write n as a sum of k squares, we go through all possible cases of what the first / squares add 


up to. And this also relates nicely to our generating function: we have 
O(T, ky)O(T, ko) = O(T, ky + ko). 


Since q = e?7'7, our function @ is Z-periodic. This means that we already have (assuming absolute convergence for 


(|p i v4) =6(T+1)+ (7,4) 
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now) 


for all 6, so the action under T is “correct.” For the action under S, our goal is to show that 0 (-2,4) is related 
to 6(7,4). Define the function @ = 6(7,1); then we know that 6% = @(7, 4), so it’s good enough to figure out the 
behavior of @. @ is the number of ways to write a number as a single perfect square, so @ = )o ye e2nid’r (possibly 


with an extra constant factor), and then Poisson summation gives us 


1 
6 (-=) = V-2iT0(T). 
Ar 
. ' : ‘ _ {oO -1 ; . 
Notice, though, that this transformation gives us the matrix fe Gul which does not have determinant 1. Instead, 


el] bp alle | 


and this gives us (with some algebra) 


we'll have to use 


rT 
4r+1 


which maps T to 


6 (4 :) = /47 + 10(rT). 


This implies (raising everything to the fourth power) that 


9 G4) = (47 +.1)76(r, 4). 


So now we've verified the modular form equation 
6(y(T), 4) = (ct + d)?0(7, 4) 


1 0 
under action of a 4 and ; le Before, we cared about being weakly modular with respect to S and T, and now 


we'll look at being weakly modular with respect to these new matrices. But here we're describing things with respect 


to the actual generators, and we want something more general: 


Definition 120 


Let N be a positive integer. The principal congruence subgroup of level WN is 


r(N) =A ; € SLo(Z): [ ; = ; : mod uh 


where congruence is taken entry-wise. 


As a check, we can make sure ['(N) is actually a subgroup of SL2(Z) — this can be done just by running through 
the axioms. Notice that [(1) = SL>2(Z), and also that [(N) has finite index in SL5(Z). Later, we'll state a result 


that helps us explicitly calculate this index, but first we'll finally make our important definitions: 


Definition 121 
A subgroup [ Cc SL>(Z) is a congruence subgroup if [(N) CT for some N. Then is called a congruence 


subgroup of level NV. 
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Definition 122 


Some special subgroups: 


hon [ ; € SL(Z): ? 


r,(N) -4[ ; € SLo(Z): ? ; 


where * can be any element. 


This may seem a bit unmotivated, but we'll see why they're useful later. It’s possible to find generators explicitly 
for [(N),Fo(N), and [,(N), but it’s a bit annoying. 


Proposition 123 
Let N be a positive integer. Then the following results hold: 


1. P(N) CPi(N) CTo(N) C SLo(Z). 
2. The map SL2o(Z) > SL2(Z/nZ) (the natural map) is a surjection with kernel [(N), so SLo(Z)/T(N) is 
isomorphic to SLo(Z/nZ). 
b 
. Consider the map ['1(Z) > Z/nZ, sending ? ; to b mod N. Then this map is a surjection with kernel 
Cc 


T(N), so [4(N)/T(N) is isomorphic to Z/nZ. 


b 
. Consider the map [1(Z) — (Z/nZ)*, sending a ; to d mod N. Then this map is a surjection with 
iG 


kernel [1(N), so To(N)/I1(N) is isomorphic to (Z/nZ)*. 


The main point is that [(N), [o(N), [1(V) are not super mysterious: we can find explicit correspondences between 


them. We'll just do a quick sketch of the second point: 


1 0 
Proof sketch of (2). The identity element of SL2(Z) is ; if so T(N) is indeed the kernel here by definition. 


Showing surjectivity just requires us to lift SLo(Z/nZ) to SL2(Z), and the Euclidean algorithm tells us that this can 


be done. 


This isomorphism allows us to find explicit formulas for the index of these subgroups, and this tells us that [9 and 


[1 have finite index (which can be explicitly calculated as well). 
Definition 124 
a b 
Let the factor of automorphy /(7,7) = ct + d for a matrix y = | ; . Also, let the weight-k operator [y]x 
G 


be defined as 


(Flale)(t) =i, 7) “F(¥(7)). 


We find after unpacking definitions that this just gives us a nice way to say that f is weakly modular of weight k if 
fly]x = f. We can also translate properties (like those about group actions) that we previously proved into operator 


notation: 
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Lemma 125 
We have, for all 7 € H, 7,7’ € SL2(Z), 


Ys i CY CYT) 
(y7’)(7) = ¥(¥'(7)). 


lyy']k = [y«ly’]« (as operators). 


Im 


Im(y,7) = Ga rR 


perdi e 
IOGR)s 


This is making sure that the action under yy’ is the same as the action under 7’ followed by the action under ¥. 
The same proofs work — we're just using operator notation, and the idea is that there’s a more intrisic view of these 
operators independent of the modular form f. One nicer way to show these that we haven't discussed yet, though, is 


to define a group action of SL2(Z) on column vectors by matrix multiplication. Then we find that 


Y Hl = Po IC), 


and this allows us to show the identities easily. 
The point is that if f is weakly modular with respect to a set of matrices, it’s also weakly modular with respect to 
the group that is generated by those matrices. It turns out that the theta function 6(7, 4) is weakly modular of weight 


2 with respect to the subgroup [9(4). 


Diamond and Shurman 1.2 (continued) — Dhruv Rohatgi 


We'll now formally define modular forms with respect to the congruence subgroup and apply this to the four square 


theorem. 


Definition 126 
Let M;(I) be the vector space of modular forms of weight k over a congruence subgroup I. A function f : H+ C 


is weakly modular of weight k with respect to I if 


1. f is weakly modular of weight k with respect to [, 


2. f is holomorphic on H, 


3. f is holomorphic at the cusps. 


Let's define what this last point actually means by thinking about what happened when T = SL2(Z). In that case, 
we knew that f(z) = f(z +1), so f was Z-periodic, and there was a transform f : D \ {0} > C defined by 


F(z) = Fle") = F(a), 


Because f was holomorphic on the upper half-plane, f was holomorphic on the punctured unit disk, so there was a 


Laurent expansion in q: our condition forced us to say that there were no negative coefficients there. 


, 1 N ; ; . 
Now, in our general case, we know that [ > [(N) 5 ae at where this last matrix represents translation by N. 
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So a function that is weakly modular with respect to [ satisfies f(z) = f(z +N) for some N, and now we can define 
f(z) = Cada? 


where we'll define g = qy = e27'2/N_ As before, we have a Laurent expansion at g, and we'll say that a function f 
which just satisfies (1) and (2) (that is, weakly modular of weight & with respect to [, and holomorphic at HI) is also 
holomorphic at co if f has a holomorphic extension at q = 0. 

But this isn’t strong enough for a general subgroup [: we need that the vector spaces of modular forms of a 
fixed weight is finite dimensional. In the SL2(Z) case, we can map any rational point on the real line to co with a 


transformation in SL5(Z), but this isn’t quite so nice in the general case: 


Definition 127 


Let [ be a congruence subgroup. Then a cusp of I is an equivalence class of QU {oo} under action by [. (Because 


l has finite index, there are only a finite number of cusps.) 


Definition 128 
A function H — C satisfying (1) and (2) above is holomorphic at the cusps if f(z) is holomorphic at oo for all 
7 € SL2(Z). 


(Notably, this maps any cusp to infinity.) At first glance, this seems a bit hard to work with: if we're given a 
q-expansion, it’s easy to say that it’s holomorphic at oo, but not so much with other cusps. Fortunately, we have an 


easier condition to check: 


Proposition 129 
Let f : H > C be holomorphic (on HI) and weakly modular with respect to a congruence subgroup T of weight 


N, and suppose f is holomorphic at oo with coefficients of g-expansion 


f(z) = Se anq” 
n=0 


satisfying a, = O(n’) for some constant r (polynomially bounded), then f is modular form with respect to T (it 


is holomorphic at the cusps). 


Proof sketch. Doing some calculations shows that if z = x + /y, we have 


Cy 


lf(Z)l<e+ val 


by doing some integration. Now if we let a € SLo(Z), it suffices to show that q- f(az) converges to 0 as q > 0. 
Then q- f(az) extends to a holomorphic function with a zero, so we can divide out the q again. To do this, we use 


the above bound: fixing q and taking g = e27’2/" for x € [0, N] (without loss of generality), 


Imz 


Im(az) — Jez + d2’ 


and using the bound on x and Im z, this is at least Wey = So the magnitude of |f(az)| is at most O(y't+), but 


q is exponentially decaying, so we're done. 


We'll return to the motivating example from David's lecture: 
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Proposition 130 


64(z) = 0(z, 4) is a modular form in. M>((4)). 


Proof. We showed in the previous lecture that @4(z) is weakly modular. To show that this is holomorphic, we know 
that 


co 


64(z) = S- r(n,4)q". 


n=0 
On any compact subset of HI, g is bounded by 1, and r(n, 4) is bounded polynomially, so this will converge absolutely. 


And because |r(n,4)| = O(n*), we can use the previous proposition to show that 64 is holomorphic at the cusps as 


well. 


We'll now look at some examples that help us characterize our vector space of modular forms. Now that we can 


have modular forms of any integer weight, we now write the weight 2 Eisenstein series as 


622) = OO pape = 2612) - 8? Doula” 


neZ meZ n=1 


This isn’t a modular form in any reasonable sense, but here’s how we can use it: 


Proposition 131 


We have the functional equation 


Go(yz) = (ez + d)*Go(z) — 2mic(cz + d) 


a b 
for any y = i in SLo(Z). Thus, for any positive integer n, we can define 
G 


Go,y(Z) = Go(z) — NG2(Nz), 


and this will be a modular form of weight 2 over [9(N). 


Proof sketch. We saw in a previous lecture (with some difficulty) that 
1 5 
G, a) eee Go(z) — 2m iz, 


and also that 
Go(z + 1) = Go(z). 


Thus, the functional equation is satisfied for -y = S,S~!,T, —/ (if we just plug in the correct values of c and d for each). 
Suppose that Go(71Z) = (cxz+d)*Go(z)—27/c(C,z+d,), and also that Go(Y2z) = (@zZ+d)*Go(z)—2T/co(Gz+d2). 
Then we can check by direct computation that Go(71¥2Z) also satisfies the desired equation, and since S,S~1, T, —! 
generate SL5(Z), we've shown the functional equation for all -y. 

Now if we define Go, as above, we can show that this is weakly modular with respect to Fo(N) by direct compu- 
tation. To show holomorphicity, we do something similar as with @4: since the coefficients of Go(z) and NG2(Nz) are 


polynomially bounded and we're subtracting two such terms, their difference is also polynomially bounded. Using the 


proposition above then shows that Go (Zz) is a modular form. 
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Example 132 


Consider N = 2 in the above calculation, giving us the series Go o(N). 


Then 
G2,2(Z) = Go(z) — 2G2(2z), 


and looking at the g-expansion shows that 
Go,9(z) = —2¢(2) — 8x* S© (a1(n)q” — 201(n)q?") = —2¢(2) — 8x? S © (o4(n) — 201(n/2)) @”, 
n=1 n=1 


where 0; is zero if its argument is not an argument. Notice that 20,(n/2) is actually the sum of all even divisors of 


n, sO we can write 


—2¢(2)- 8m? S57] So |g’. 


n=1 \d\n,24d 


This is a modular form with respect to [9(2). 


Example 133 


Take N = 4, giving us the series Go.4(N). 


This, similarly, shows that 


Ga4(z) = =7° = an?) S- d|q" 


n=1 \d|n,4td 
This is a modular form with respect to [9(4). Since Go,o(z) is a modular form with respect to [9(2), it’s also a modular 
form with respect to [9(4). So now we have three modular forms with respect to [o(4), but it turns out (we'll learn 
in May, perhaps) that the dimension of Mo(lo(4)) = 2. This means that 64 is in the span of Goo and Go (because 


those two are linearly independent), and we also know the first few coefficients 
This is enough to tell us what linear combination to take: we actually just want 
1 
04(Z) = ~ 73 G2.4. 


So we know what the g-expansion looks like: we have 


ax(z)=148)> yea 


n=1 \d|n,Atd 


And the coefficients here are always nonzero, because 1 always divides d, and we've proven Lagrange’s four squares 


theorem! 


11 March 12, 2020 


The two speakers (Kaarel and Anton) for today aren't coming due to sickness, so we'll have to change the schedule a 


bit. (They'll present after spring break.) 
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The deadline for our paper abstract will stay the same, but we will have an extension. (We should make sure we 
do submit it by the end of break, though.) 
Office hours, as well as class, will be done by Zoom. We'll be livestreaming student lectures, so the time zone for 


students is important. We can either use a tablet or use slides / Beamer. 


12 March 31, 2020 


Diamond and Shurman 1.3 — Kaarel Haenni 


We'll start by recalling a lemma from a previous lecture: two bases give the same lattice if and only if we have a 


basechange matrix y € SL2(Z) that gets us from one to the other. 


Definition 134 


A complex torus is a set C/A of additive cosets {z +I : z € C} with algebraic structure as a quotient of C: 


addition is given by addition in C, and geometric structure is also induced by C. 


This turns out to make C/A a Riemann surface — we won't define this rigorously, but we can just think of this as 
things looking “locally like C.” A good way to think about C/A is that we have a fundamental domain of the lattice A, 
except we identify opposite edges. Topologically, there are neighborhoods that look like normal neighborhoods in C, 
but there are also neighborhoods that “cross over the boundary:” quotienting one pair of edges gives us a tube, and 
then quotienting the other pair gives us a torus. 


We might have seen this next result from regular complex analysis before: 


Theorem 135 (Open mapping theorem for Riemann surfaces) 
Let X and Y be two Riemann surfaces, and let f : X — Y be a holomorphic map. Then either f is constant or f 


is an open mapping (it maps open subsets to open subsets). 


(Here, holomorphic means that the map looks holomorphic on neighborhoods that “look like” C.) 


Proof sketch. Restrict the map to neighborhoods and then apply the usual open mapping theorem; now take a union 


over the whole surface. 


Corollary 136 


Let X and Y be compact Riemann surfaces, and say that f : X — Y is holomorphic. Then f is either compact 


or a surjection. 


Proof. A theorem of topology says that if X is compact, then f(X) is compact, and therefore f(X) is closed. But X 


is also the whole space, so X is also open, meaning f(X) is open by the open mapping theorem. Thus f(X) is closed 


and open, and Y is connected — therefore Y = f(x) LI f(x) means that f(X) must be the whole space. 


Now we'll look at a specific set of maps that we care about: in particular, everything from before also applies 


because complex tori are compact Riemann surfaces. 
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Proposition 137 
Suppose @: C/A > C/N is holomorphic. Then there exist m,b € C such that mA Cc A’, and the map has the 


explicit form $(z + A) = mz+ b+A’, where m and b are unique mod A’. Additionally, this map is bijective if and 
only if mMA=N. 


Proof sketch. A lifting theorem from topology means that we can lift @ to a map between the universal covering 
spaces. The universal covering space of a complex torus is C, so we have a map ¢ from C > C which satisfies the 


following commutative diagram: 


go? 4 


[ms [rw 


CHA == 5G" 


Now for any 4 € I, consider the function (A(z) = 6(z +A) — $(z). Since the diagram commutes, fy must map to NV’ 
((z +A) and z need to end up in the same element of A’, because they project to the same thing in our commutative 
diagram). But A’ is a discrete subset of C, so any continuous map into A’ has to be constant. Thus its derivative is 
zero, which means that $/(z + A) = 6(z)/. But now the derivative of ¢ is holomorphic and it’s also A-periodic — this 
means that it is bounded, and therefore it is constant by Liouville’s theorem. This means that $(z) = mz+ b, as 
desired, and then finding @ from @ is diagram chasing with the above commutative diagram. 


And this is an if and only if statement: any map of this form is holomorphic, as long as mA C NV’. Explicitly finding 


two elements that map to the same element (showing that it’s not injective) gives us the last part of this theorem. 


We're considering holomorphic maps, which preserves the geometric structure, but we also want to preserve the 


algebraic structure: 


Corollary 138 
Let 6: C/A — C/N be holomorphic, or equivalently ¢(z + A) = mz+ b+ A’ with mA CN’. Then the following 


are equivalent: 


* @ IS a group homomorphism. 
* bE WN’, which means that $(z + A) = mz+WN. 


- ¢(0) =0. 


This is not too hard to show — we just plug in the properties of the group homomorphism. In particular, this tells 
us that there exist a nonzero holomorphism from /CCA to C/A’ if and only if there is a nonzero m € C such that 


mA C AN’ (and we have an isomorphism if there is equality). 


Example 139 
Consider a lattice A = w,Z ® WoZ with the usual normalization T = ce € H. Let A, = TZ © Z, and consider the 
map 

ob, : C/A > C/A, 


given by @-(Z + A) = Z/wo +A. 
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We can notice that this indeed maps C/A to C/A, — this is an isomorphism of complex tori, and in fact this sends 
our complex tori (up to SL2(Z)) to some T € H. To be more precise, tori are isomorphic if they are sent to the same 


orbit — the isomorphism classes of complex tori are in bijection with orbits of SLo(Z) of the upper half plane. 


Definition 140 


An isogeny is a nonzero holomorphic homomorphism between complex tori. 


We have the following properties: 


* Holomorphic isomorphisms are isogenies. 
« Isogenies surject. 


+ Isogenies have finite kernel. (This is because complex analysis tells us that it has a discrete kernel, and then we 


have a discrete set in a compact space.) 


There are two main examples of isogenies that we care about: these might not be all of the isogenies, and this has 


to do with complex multiplication. 


Example 141 


The multiply-by-N map [N] : C/A > C/A sends a point z+ Ato Nz+A. 


The kernel here is important: it’s the set of points z +A such that Nz € A, and we call this the set of N-torsion 


points. Letting E = C/A, we notice that we have a group structure: 
E[N] = Z/NZ x Z/nZ. 
If we draw a picture, this basically means we split up our complex torus into an N x WN grid. 


Example 142 
Let C C E[N] be a cyclic group of order N in the N-torsion subgroup. Then the preimages of C in the map 
C + C/A form a superlattice of A, which we also call C — this is because we can pick an explicit basis (W 1, W2) 


and also of the generators of C. 


Then we get a map m: C/A > C/C, where z + A is sent to z+ C (this is a projection map). We can convince 
ourselves that this is a holomorphic homomorphism, and its kernel is C. Pictorally, the easiest example of this is to 
take the subgroup generated by “7: then C/C looks like a slice of the previous fundamental domain, and then we just 


project down. 


Proposition 143 
Any isogeny @: C/A > C/N’ can be factored as 


o:c/ASC/ASC/nkK SC/N, 


where K is the kernel of @. 


Basically we multiply by n, then do a cyclic quotient, and then we do an isomorphism. (Proof omitted for now.) 
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Proposition 144 


C/A is isogenous to C/A’ if there exists an isogeny between the two. This is an equivalence relation. 


Proof. Being reflexive and transitive are both easy — symmetry is a bit trickier, but the idea is that there is a dual 


isogeny that we can explicitly construct. 


This dual isogeny has a few properties: 
¢ The multiply-by-N map's dual is itself. 


+ The dual of a cyclic quotient is the dual of a different cyclic quotient of the same order (the “orthogonal one”) 


and then scaling up by N. 
« The dual of an isomorphism is its inverse. 
« The dual of the isomorphisms is the isomorphisms of the dual, but in the opposite oder (just like functions). 


¢ The dual of the dual is itself. 


* The sum of duals is the dual of the sum. 


For the purpose that we’re using these for, which is studying modular forms, it turns out that isogeny is a better 
equivalence relation than isomorphism. 
The last topic of this section is the Weil pairing, but we don’t have too much time to talk about it: recall that 


E[N] = (wi/N +A) x (wo/N +A), 


and let iy be the group of Nth roots of unity. We can define an inner product ey : E[N] x E[N] > Ly as follows: 


given P,Q € E[N], we can assemble a coefficient matrix y such that 


wi/N+A 
Wo/N+A]- 


We then define 
en (P, Q) = erridety/N 


and we can check that this is actually independent of our choice of basis for A — this is because determinants are 


basically the ratio of areas for fundamental cells. 


Diamond and Shurman 1.4 — Anton Trygub 


Today, we’re going to talk about the connection between elliptic curves and complex tori. We'll start with any lattice 


A and consider the Weierstrass p function 
1 1 1 
oonmat © leew) 
weA—{0} 


for all z € C not in the lattice A. (We'll show later on that this is an absolutely convergent sum.) Notice that that 


the derivative of this function is 


F 1 
gp (2) =-2)° wy 


wen 
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and now if we consider any w € A, we can consider the function 


F(Z) = @(2 + w) — @(Z). 


Then the derivative is 
f(z) = p'(z+w) — @'(z) =0, 


because @’ is periodic, so f(z) is constant. And then we can calculate the exact constant by substituting in —$: then 


(-3) =#() -#(-3) = 


because g is an even function. Therefore g(z + w) = g(z) for all w € A, and g is periodic. 


Definition 145 


The Eisenstein series for a lattice A are defined as 


GK(A= >> 


weEA—{0} 


We're going to make a connection between elliptic curves with complex tori now through the next few results: 


Proposition 146 


The Laurent expansion of (Z) is 


pl)=St DY (m+ 1)Go42(A)z" 


n>2, even 


for all 0 < |z| < inf{|w| :w EA -— {O}}. 


Proof. We can rewrite 
1 1 1 1 1 2 
= 1)=5(14+54+(=) +--)?-1 
(z—w)? w2 w? \(1—Zz/w)? Ww? W Ww 
and then expanding this out yields 
2 
— (2 13-44 (=) 4 oy). 
WwW W W 
| 


Taking p = SUPyer—{o} id (which is less than 1 by assumption), this sum is bounded absolutely by 


S| (2 t 3p + 4p? +--+) <oo. 


Now summing over w, since >> wl was shown to be absolutely convergent, we do indeed have absolute convergence 
of this series. 


So we can now rearrange terms to find that 
i _ Zz 
p= Del 
weA—{0} n=1 


and the terms with odd n cancel because the ws in the denominator have opposite signs. This yields the desired result 


by evaluating the inner sum to be Gpyo(A), and now we've shown that g(Z) is well-defined as well. 
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Proposition 147 


We have the relation 
('(z))* = 4@(z)° — g2(A)@(z) — 93(A), 


where go(A) = 60Ga(A) and g3(A) = 140G¢6(A). 


Proof. From the above proposition, we know that 
1 2 4 6 
g(z) = 2 + 3G4(A)z* + 5G6(A)z" + O(2”), 


and also that 5 
g'(z) = a 6G4(A)z + 20G6(A)z? + O(z°). 


By expansion, we see that ~o'(z)? and 4@(z)? — go(A)@(z) — g3(A) are both 4 — 4&4 _ goG_(A) + O(z?). Thus 


their difference is holomorphic, but it is also A-periodic. Thus it is bounded and constant, and taking z + 0 makes 


this difference go to 0, and thus the difference is zero. 


Proposition 148 
Let A be a lattice generated by (w1,w2), and say that w3 = Ww, + We. Then the cubic equation satisfied by o(z) 
and @'(z) above, y? = 4x? — go(A)x — g3(A), can be rewritten as 


2 — A(x — e,)(x — e)(x — 3), 


where e = 9 (%) are distinct. 


Proof. The function f(z) = g(z)—t is meromorphic for any t, so there is some translation P of the basis parallelogram 
such that there are no poles or zeros on that boundary. This boundary also has opposite sides being equal, so that 


tells us that the contour integral 
1 F(z) | 


2ni ap f(z) 


Therefore, the number of poles and zeros inside of OP is equal. The only pole is the point inside P in the lattice A, 


and that has order 2, so there are two zeros of f inside P. 
But @’ is an odd function, so w’ (4) = w’ (—¥) for any w € A. So then we can take the e; = @ (4) that we defined 
above, and they will indeed be roots of 4x? — go(A) — g3(A) (because those are exactly the points where @'(z)* = 0). 
wi 


And these roots are pairwise different because 5 is a double root of g(z) — ¢; (its derivative is zero at that point too). 


Since we're only allowed two zeros in P, there is no other value of z in P such that g(z) = ej. And this logic works 


for every point in g(z). 


Corollary 149 


The function A is nonvanishing on H. 


Proof. For any tT € H, we can consider the corresponding lattice A, and the cubic 4x? — gp(A,)x — 93(A,). This 


g(t)> _ 2793(T)? _ A(t) 
16 


a ig and this is nonzero because the roots are distinct by the previous proposition. 


discriminant is 


Thus A is nonzero. 
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So now we know that the map z > (ga(z), @a(Z)’) takes nonlattice points of C to points on the elliptic curve 
y? = 4x? — go(A)x — g3(A). Indeed, for any value of x € C, we have two values of y that satisfy the equation 
corresponding to the points (x, y) and (x, —y) respectively. We can also extend this map by defining a point at infinity 
for the elliptic curve, and this means that we have a bijection between each complex torus C/A and each elliptic curve 
of the form y? = 4x? — go(A)x — g3(A). 

Call this map (g, @). (g, @') actually transforms the group law from the complex torus onto the elliptic curve. 
If we have two points z; +A, Z2-+A on the torus, then (9(21), @'(Z1)) and (g(Z2), p'(Z)) form a secant (or tangent) 


line in C? of the form ax + by +c =0. Consider the meromorphic function 
f(z) = ap(z) + be'(z) +c. 


We'll use the argument principle again. When b ¥ 0, this has a triple pole at O and zeros at z,; + A, Z +A. We must 
have three zeros in this case, and the third zero must be at the point where z, + Z +Z3 = 0. Meanwhile, when b = 0, 
f has a double pole at O but still zeros at z, + A and Z +A, so we can say that z, + Z +z; = 0 in C/A as well 
— this corresponds to the point at infinity (@(0), @’(0). The whole point is that the points on the elliptic curve that 
also satisfy ax + by +c = 0 are exactly (f(z), @'(z)), where we've just defined z3, and thus collinear triples on the 
elliptic curve sum to zero in the complex torus. 

We've described a way to go from a complex torus to an elliptic curve, and it turns out we can go in reverse as 


well: 


Theorem 150 


For any elliptic curve y? = 4x? — aox — a3 where a3 — 27a3 # 0, there exists a lattice A such that a2 = go(A) and 


a3 = g3(A). 


Proof. We have the surjective map / : Hl — C, so for any a2 and a3, there exists a 7 such that 


172823 go(T)? a 


(T) = ; 
jr) a3 — 27 a3 go(T)3 — 27g3(T)? a3 — 27.a3 


When a2 and a3 # 0, we can take the reciprocal of both sides and subtract 1 to find that 


2793(T)? 27 a3 
go(T)? a3 


This is an invariant quantity when we scale a lattice by w: in fact, g3(A) = 93(T)w~°. 


Pick w accordingly, and then 
93(A) = a3 and go(A) = ao as desired. 


In the edge cases, if a2 = 0, then go(T) = 0, and we can just scale w again to get g — 3 to the correct value 


without worrying about go. The same thing works If a3 = 0. 


The important takeaway here is that complex tori and elliptic curves are interchangeable! 


13. April 2, 2020 


Diamond and Shurman 1.5 — Swapnil Garg 


The topic for this section is moduli spaces and modular curves, but we'll only be talking about specific examples of 


the former rather than general theory. Recall that the Weierstrass g function gives us a bijection between complex 
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tori (equivalent to fundamental domains of lattices in C) and complex elliptic curves, and the mapping is also a group 
isomorphism (with addition corresponding to collinearity). We also know that there is a holomorphic group isomorphism 
(meaning that the two complex tori are isomorphic) if and only if A’ = mA for some m € C: this means one is a 
complex multiple of the other. 

The purpose of this lecture is to show that complex elliptic curves are in bijection with orbits of the action of 
SL>(Z) on the upper half-plane. We'll define specific kinds of moduli spaces here — these three are the only ones that 
we're going to be using, so we won't define a moduli space completely. Recall that for any integer N, E[N] is the 
N-torsion subgroup of E, which is the elements in the complex torus with Ng = 0. We know that |E[N]| = N? (we 
have an N by N grid), and the Weil pairing is defined as 


en(P, Q) = ernidety/N 


where det ¥y is the size of the fundamental domain of the lattice generated by P and Q, relative to the size of [/N. A 


few weeks ago, we defined some congruence subgroups of SL3(Z): 
ok ok 
Po(N) = {re stata y= ; | mod uh 
* 
1 * 
n(n) = {1 Sate) = ; mod uh 


T(N) = {" € SL(Z): y= E mod v| ' 


We'll connect these definitions with our elliptic curves as follows: 


Definition 151 


An enhanced elliptic curve for [o(N) is an ordered pair (E, C), where E is a complex elliptic curve and C is an 


order N cyclic subgroup of E. An enhanced elliptic curve for [4(N) is an ordered pair (E,Q), where E is a 
complex elliptic curve and Q is a point on E of order N. An enhanced elliptic curve for [(N) is an ordered pair 
(E, (P, Q)), where E is a complex elliptic curve and (P, Q) generate E[N] with the Weil pairing ey(P, Q) = e27//%. 


Notably, we can generate an enhanced elliptic curve for Fo(N) from one for F4(N) by looking at the cyclic group 
generated by Q. For all three definitions, we define an equivalence relation such that (E, x) and (E, x’) are equivalent 


if there is an isomorphism that sends E — E’ and x > x’ — we'll denote the equivalence classes as [E, x]. 


Definition 152 
Modding out the sets of enhanced elliptic curves for [o(N),1(N), and F(N) by the equivalence relation yield 
So(N), Si(N), S(N) respectively, which are examples of moduli spaces. 


When N = 1, all three of these are just the space of complex elliptic curves with equivalence by scaling (that we 


introduced earlier), since the torsion group is trivial. 


Definition 153 


The modular curve Y(I) for a congruence subgroup [ of SL>(Z) is the quotient space of orbits of H under the 


action of T, which can also be written as [\ HI. Denote Yo(N), Y:(N), Y(N) to be the modular curves for To, Fy, F 


respectively. 
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Recall that A; denotes the lattice TZ @ Z for any tT € H. Such lattices correspond to elliptic curves E,, and this 


iS unique up to adding an integer to T. 


Theorem 154 


We have the following descriptions for the moduli spaces introduced: 


1. The moduli space of Fo(/V) is 
So(N) = {[Er, (1/N + A,z)] : 7 € Hi}. 


2. The moduli space of [4(N) ts 


So(N) = {[Er, 1/N + Aq] 7 € H}. 


3. The moduli space of F(N) is 


S(N) = {[E,, (7T/N+A,,1/N+A/r)] : 7 € H}. 


Basically, all of the order-N subgroups are isomorphic to taking the point 1/N (which generates a specific subgroup 
in the lattice). Here, Fi + A, denotes an additive coset of C, and indeed N times that point is an element of the 


lattice. 


Proof. Note that E is isomorphic to C/A, for some tT’. The three proofs are similar: for (1), if we have an enhanced 
elliptic curve (E, P) in So(N), we can say that the cyclic subgroup P is generated by (ct’ + d)/N+A,/, and then we 
can find a,b such that ad — bc = 1 mod N. In (2), we similarly find a, b for the pount (cr’ + d)/N +A;), and in 
(3), we know that (at’ + b)/N +A, (cr’ + d)/N +A, are the two points that generate E[N]. In all cases, we get a 


b 
matrix : i in SLo(Z/NZ), which lifts to an element y € SL2(Z). Now the element 7 we want is 
Cc 


a aT’ +b 
ct’ +d’ 


because we indeed can write this enhanced elliptic curve with m= ct’ +d and 
mA, = m(1TZ@Z) = (at +b)Z 06 (cr + da)Z=7'1ZOZ=AN,, 


where the third equality comes because we're acting by an element y~* of SL2(Z), and then 


In other words, we’ve found a 7 such that our enhanced elliptic curve reduces to the desired form. We now want to 


show that each equivalence class [E;, (data)] map to equivalent enhanced elliptic curves. This can again be done by 


writing our elements T = yr’ and verifying that multiplying by m= cr’ + d sends one to the other. 


Theorem 155 


In all three cases of the theorem above, we have equivalences [E,, (data)] ~ [E,, data] if and only if [fr = Pr’ 


(so T, 7’ are in the same [-orbit and map to the same element in the modular curve). Thus, we have bijections 
from So(N) > Yo(N), Si(N) — Yi (N), and S(N) > Y(N). 


Proof. We've shown the backwards direction above already. Now if we have equivalent enhanced elliptic curves from 


E, and E,, then mA; = Az. Since (mrt, m) now forms a basis for A;, we can write mt = at’ + band m=crT' +d, 
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a b 
which gives us ‘Y = . We wish to show that ¥ is indeed in the corresponding congruence subgroup IT, which we 


can verify by noting that m(1/N +A,) goes to 1/N+A,. 


We can call these specific representatives enhanced elliptic curves of the special type. For example, (E,,1/N + 
A,) is an enhanced elliptic curve of the special type, but not (F,,-1/N +A,). This theorem indeed tells us that 
Y(1) = SL2(Z)\H, which is what we wanted to show initially (because torsion data doesn’t tell us anything). We 
talked about the j-invariant earlier in this class — we can now associate each complex elliptic curve with the orbit 
SL»5(Z)r, and can define a value /(E) = j(7T) for each curve. It turns out that elliptic curves with rational j-values 
correspond to modular forms — this is connected to Fermat's last theorem! 

Since moduli spaces and modular curves are equivalent, we can now take maps of modular curves and make them 


into maps of moduli spaces. 


Example 156 
We have a natural map from Y;(N) — Yo(N) taking orbits of 7 in [4(N) to orbits in [o(N). This translates into 
a map from S;(N) to So(N), which takes [E, Q] to [E, (Q)]. 


Example 157 
T4(N) is a normal subgroup of To(N), so the quotient acts on Yi(NV). Translating this to our moduli spaces, we 
get an important map +, defined as 


Mi(N)y : [E, Q] + [E, dQ] 


(where d is the bottom-right element of the matrix y). This is a Hecke operator, and it will come up later. 


Recall that modular forms have some weight k by definition: there’s a connection here as well. 


Definition 158 
Let T be one of the groups Fo(N),Fi(N), P(N). A function F : {enhanced elliptic curves for f + C is degree-k 


homogeneous with respect to I if we have 
F(C/mf, mx) = m-*F(C/A, x) 


for any lattice A and complex number m. Here, x is a cyclic group of order N if T is Fo(N), a point of order N if 
it is [4(N), and a pair of points if it is F(N). 


Basically, functions on enhanced elliptic curves give functions on the upper half-plane (as a function of T). 


Definition 159 


The dehomogenized function f : H — C corresponding to a degree-k homogeneous F is 


(Ge) — a (C/iNoo) 


where x is (1/N+A,) if [ =[o(N), 1/N+A, if f =[1(N), and (7/N+A,,1/N+A,) if it is [(N). 
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Proposition 160 


A degree-k dehomogenized function f is weight-k invariant with respect to [. In other words, f(y(7)) = (cT + 


ay rep). 


We can see this by letting m = (ct +d)~! in the definition. 

Recall that lattice functions correspond to functions on the upper half-plane: we're saying here that functions on 
enhanced elliptic curves do the same, and we can get a function F on enhanced elliptic curves from a function f that 
is weight-k invariant with respect to a congruence subgroup. This is well-defined, because two enhanced elliptic curves 


of the special type will have the same values of f, and then we can define 


F(C/Ap, x) = m-*F(C/A,z, x). 


Diamond and Shurman 2.1-2 — Natalie Stewart 


The point of this lecture is to start allowing us to talk about everything from a differential geometry perspective with 
Riemann surfaces. Recall that a congruence subgroup [ C SL5(Z) is a subgroup containing one of the principal 
congruence subgroups F(N) — then the modular curve for F is the orbit space Y(T.) = T HI. But we only really know 


how this looks with respect to the quotient topology, and we're going to upgrade this now. 


Definition 161 


A Riemann surface is a connected complex 1-dimensional manifold — that is, it’s a connected topological space 


with a countable basis which is Hausdorff, and for any two intersecting neighborhoods U,,, U,, we have coordinate 


charts dm, dn > D (the unit disk) such that the following composition is holomorphic: 


Om(Um A Un) > Um A Un > On(Um 1 Un). 


One note about the last point is that we can map U,, and U, conformally to the unit disk D, and we want the 
“transfer map” between the images of the intersection U,,M U, onto the disks to be holomorphic. 

We have a surjective open mapping H — Y(T) (which maps 7 to ['7), which tells us immediately that Y(T) is 
second-countable and connected. To show that Y(I) is a Riemann surface, we need to show that it is Hausdorff, 


providing coordinate charts on Y(T), and we'll show that those transfer maps defined above are holomorphic. 


Proposition 162 
Let 71, T2 be two points in the upper half-plane. Then the action of SL2(Z) (and of I) is properly discontinuous: 


we can find neighborhoods U; and U2 of 71, T2 such that 


¥(U1) U2 AS => (1) = Te. 


This takes a long time to prove, so we won't talk too much about this here. 


Corollary 163 


The modular curve Y(T) is Hausdorff for any congruence subgroup I. 


Proof. Suppose we have two different points 7(71), (72) € Y(T). By definition, 71, To are in different orbits, so we can 


choose two neighborhoods as in the above proposition so that y(U1) M U2 = @, which means that m(U1) N m(U2) = @ 
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(all of the points in the neighborhood of 7, are in different orbits as those in T2). Since 7m is a quotient mapping, it’s 


an open mapping, and thus 2(U;) is a neighborhood of m(7;), which gives us our desired neighborhoods. 


Giving coordinate charts is significantly more difficult, so we'll go through some random-seeming constructions 


which will come together: 


Definition 164 
Let G be a group acting on a space X, and let [ C G be a subgroup. The isotropy subgroup for 7 under [ (for 


any T € X) is the set of y € T which fix T. 


Definition 165 


A point 7 € HH is an elliptic point for [ if the containment {+/}I, D> {+/} is a proper containment. (m(T) is 


also called elliptic.) 


In other words, elliptic points are preserved by some nontrivial transformation of H by T. 


Proposition 166 
Let [ be a congruence subgroup of SL2(Z). Then the isotropy subgroups I; are finite cyclic groups for all elliptic 


points T. 


We'll prove this next lecture — this was also proved in Serre. This allows us to make the following definition: 


Definition 167 
The period of 7 € H, denoted h,, is 


yz =ter 
[oe are 


hy = EDU /{£0] = 


Using a bit of general topology, we'll define this for the modular curve as well: 


Lemma 168 
Let G be a group acting on a space X, and let 1 CG. If T € X is a point, then for any y € G, we have 


a ee = ee ae 


When y € I, this reduces to the period of 7 being constant across orbits — thus, we can associate the period with 
points of Y(T) instead of rT. 


Proof sketch. If we take any a € T,, we know that yay~+y(T) = ya(T) = ¥(7), which shows that the left side is 


contained in the right. The other direction is similar — if yay is an element of the right side, then ya(T) = y(T), so 


a(t) =T. 


We'll now move to something that looks unrelated: we can extend an action of SLo(IR) on H to an action of 
a b az+b 
2 ; 
cfd cz+d 
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GL2(C) on the Riemann sphere: 


Definition 169 


IL 4p 
The straightening map for a point 7 € H is the action of ; € GL2(C). 
=7 


Note that 6,(7) = 0 and 6,(7T) = co, which is nice because we can map neighborhoods to neighborhoods of 0. It 
turns out this is also a straightening in a more powerful sense if we connect this with isotropy groups: since congruence 
subgroups have all real coordinates, [., = I, which helps when we're dealing with the Riemann sphere (and complex 


numbers). Using the above lemma about conjugation, we find an equality of isotropy groups 


(5, ({4I}P)8r)o/{EN} = 6 (LEM r/{EIP) Se" = (6r(LEN}P)Or oo {EN}, 


and we'll denote this group G. (In words, the transformations induced by the isotropy group of the image of [ under 
conjugation of the straightening map at O and oo are the same as the conjugate of the isotropy groups around T.) We 
know that G is given by the properties of the isotropy groups, and for all g € G, we know that 0 and oo are preserved, 
so we must actually have g(z) = az. In addition, because G is finite cyclic, each element of G is just a rotation around 
the origin by some integer multiple of =, since |G| = hr. 

This helps us think about how to define our coordinate charts using our straightening maps: under the map 6,, 
neighborhoods #(U) turn into radial sectors of the neighborhood of 0. Then we can apply the h,-fold “wrapping map,” 
Pn,(Z) =z", to get the whole neighborhood. 

So we need to decide what our small neighborhood U looks like, and we'll use the following corollary of proper 
discontinuity of SL>(Z) action: 


Corollary 170 
Let [ be a congruence subgroup of SL2(Z). Then 7 € H has a neighborhood U such that 


VWU)NUAS = yells. 


This neighborhood has no elliptic points except possibly T. 


Now, we can pick a point m(T) € Y(T), and we'll let U be the neighborhood of 7 from the above corollary. We'll 
introduce the notation 6 = 6,,9 = Pp, (for the straightening map and wrapping map, respectively), ~% = p06, and 
V=WvU). 

We claim there is a bijection ¢ : m(U) — V such that 6o ma = wW. This is essentially an algebraic manipulation: 
(71) = 1(T2) if and only if the two points are in the same orbit, which is true if and only if 7, is in [-72. This means 
that 

5(t1) € 6(Fr72) => 4(71) € (60 5~*)(5(72)), 


which means that the two elements have the same order — 6(7,)" = 6(72)", meaning that W(7,) = W(t2). 


Lemma 171 


@ is a homeomorphism. 


Proof. This is a verification that the map and its inverse are continuous: we know that w and 7 are both continuous 


open surjections, an open subset W of V will correspond to preimages that are open. 
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We now need to show that @ Is holomorphic, and this is a lot harder. We'll define 
V4.2 = b1(1(U1) N 1(U2)) 
(the small subset of the disks corresponding to the intersection of the neighborhoods), and define V2 similarly. 


Proposition 172 


The transition map $21 : Vi.2 — V21, defined as 2 0 o,'. is holomorphic. 


Proof. Fix a point x in the intersection of the two neighborhoods, and choose some preimages such that x = m(7) = 
W(T2). Fix y such that T2 = y(71) (they’re in the same orbit under 7, so this exists). We'll check holomorphicity 
locally on $,(x) — specifically, if Uy.2 = Uy Ny~*(U2), we'll prove holomorphicity on g21 = ¢; o (U2). 

Define 6; = 6,, and hj = h,, for convenience. First assume that @1(x) = 0. We know that any point q = $1(x’) in 


this neighborhood in question (@1 o 1(U;,2)) is of the form 
qg=*l(r') = (&(7'))™ 


for some T’ € Uy», where WW, = $107. Let ho be the period of the point Ta € U> — this To is the point such that 


wWo(72) maps to 0. Then we can chase some equalities to find that 


(4) = voll") = ((601872)(a"/")) 


The central trick here is that we stick In a 6,161 term to introduce a q. We know that 60°76; * is a fractional linear 
transformation, and we can note that 0 and oo are both fixed. Thus, 600, + is just multiplication by some a: we now 
have a nice form for the transfer map 

$2,1(q) = (aq'/™)®. 


If hy = 1, this is holomorphic. Otherwise, 7; is elliptic — 72 = y(71) is in the same orbit, so it is elliptic with the same 
period. We defined U2 in such a way that there is only one elliptic point, so T2 = T2, so hy = ho. But this is exactly 
what we want: this means that ¢21(q) = fed, which is a holomorphic map as desired. 

We've only proved this for the case where ¢$;(x) = 0, but this also proves it for the case where ¢o(x) = 0 
(the inverse of a bijective, holomorphic map is holomorphic). And now we can add an intermediate function: write 


2,1 as 2,3 0 3,1, where 3 : U3 — Vs sends our point x to 0, and the composition of two holomorphic maps Is 


holomorphic. 


And this tells us that modular curves Y(T) are Riemann surfaces, which is exactly what we wanted to show. 
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Diamond and Shurman 2.3 — Andrew Gu 


Recall that an elliptic point for a congruence subgroup FT is a point in 7 € Hl such that 


Te-={yel:yr=T} 


is larger than {-+/}. We'll also call the image of 7 in the modular curve Y(T) elliptic, and we'll say that the period of 


such an elliptic curve is |[-|/2 when —/ € [/7 and |f-| otherwise. 
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Last time, we skipped the proof that I; is a finite cyclic group, and that’s going to be the main purpose of this 
lecture. It turns out that most of the proof is showing that this works for Y(1), so we'll start by looking at the elliptic 
points there. 


Recall that the fundamental domain of the upper half-plane is defined as 


1 
D={rew:|Re(r)|< 51121}. 


[fa ob i 


We saw that the map 7 : D — Y(1) is surjective in one of the first lectures: basically we translate with T and then 


and the generators of SLo(Z) are 


use S to get the imaginary part large enough. However, this map is not injective: we know that if m(71) = 1(7), 


then either we're working with the boundary points on the left and right or with the unit circle. The idea is that for 
a b 

Y= ; 
c d 


When c = 0, we know that y = + 


T2 = YT, => [cm + 4| <1. 


b 
/ , which means To = T; + b. On the other hand, when c = +1, 


(Re(7,) +d)? < 1—Im()? < - 


which means |d| < 1 and we can again characterize the elliptic points. This gives us the result: 


Proposition 173 
The elliptic points of SL>(Z) are SL2(Z)i and SL2(Z)w, where w = e27//3. Thus, the isotropy groups for elliptic 


curves are conjugates of (S) and (ST), which have order 4 and 6 respectively. Therefore, all elements of finite 
order are conjugate to one of —/, S*!, (ST)*1, (STST)*1, which have order 2, 4, 6, 3. 


Note that T = / and T = w both generate lattices Z + ZT — one Is square and one is hexagonal. Both of these 
have rotational automorphisms around the origin, which tells us more about the complex elliptic curves associated to 
the lattices. But we don’t have too much to say about this right now. 


With this, we can think about the elliptic points of Y(T) in general: 


Proposition 174 


Let [ be a congruence subgruop. Then Y (I) always has finitely many elliptic points, each with finite cyclic isotropy 


group and period 2 or 3. 


Proof. Pick any tT € H. The isotropy group [, is a subgroup of SL2(Z),. Since SL2(Z), is always cyclic from the 


above argument, so is [,. We can think about what possible groups the [, can be: it can be a subgroup of (S), and 


in this case it can’t be a proper subgroup because we're excluding {+/}, so we must be conjugate to (S) itself. Thus 


the group has order 4, meaning that the elliptic point has order 4 = 2. Similarly, if SL2(Z),; has order 6, then the 
elliptic point has order 3. 
To show that there are finitely many elliptic points, note that [ has finite index in SL2(Z), so we can write it as 


a union of cosets [y;. We know that the elliptic points for [ are always images of / or w, so they are a subset of 
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SLo(Z)iU SLo(Z)w. Taking the images in Y(y) means that our elliptic points are a subset of 


E, = {lyi, Pyjw il <j < dh, 


which is at most 2d elliptic points, as desired. 


In general, we don’t have exactly 2d elliptic points: recall that we worked with F(N),11(V), Fo(N) in previous 


lectures. 


Proposition 175 


T(N) for N > 1, [i(N) for N > 3, and Po(N) for N divisible by a prime p = —1 mod 12 all have no elliptic points. 


We know that a nontrivial isotropy group must be generated by an element of order 3,4,6, so they must have 


characteristic polynomial x? + 1 or x*+x-+1. The idea of the proof is to show that there are no generators of this 


form in the subgroup. 


Proof. For [(N), note that it is a normal subgroup of SL2(Z), so conjugates of its elements are also in [(N). And 
notice that S, 7, ST, (ST)? are all not in F(N), so there are no elements of order 3, 4, 6. 
For [1(N), notice that the trace of the matrix is 2 mod N, and the trace of matrices of order 3, 4, 6 are all —1, 0, 1. 


So whenever N > 3, none of those numbers are 2 mod N. 


d 
with c = 0 mod p, and we know that the trace is a+ d € {—1,0,1} mod p, while dety = ad = 1 mod p. Having 


a b 
Finally, for [o(N), suppose that there is a prime p = —1 mod 12 that divides N. We have a matrix y = | 
Cc 


integer solutions to these equations means that t®? — 4ad must be a perfect square in Z/pZ. But whenever p is 


_ G)-(@)- 


and similarly when p is —1 mod 3, —3 is not a quadratic residue. Thus we get the result that we want. 


1 
For Fy, it turns out that N = 2,3 both have elliptic curves: [4(2) contains the element ; } and we can 


T= 1 


1 -1 
solve the equation T = 5-—; to get the elliptic point $+ 5 with period 2. Similarly, [1(3) contains ; | and 


—2 
s + v3j is our elliptic point in this case. 
We can use this to classify elliptic points of [o(N). Looking ahead, €2 and €3 count the number of elliptic points 


of order 2 and 3, and they show up in the dimension formulas for modular functions. 


Proposition 176 
The period 2 elliptic points of [o(V) are in bijective correspondence with ideals J of Z[/] such that Z[/]/J = Z/nZ. 


Similarly, the period 3 elliptic points of Fg(N) are in bijective correspondence with ideals J of Z[e2”//°] such that 
Te I= a, 


Basically, we consider elements of order 3,4,6 and look at quadratic residues again. This yields the following 


formulas: 
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Proposition 177 
The number of elliptic points of order 2 in Fo(N) is TT, 11y (1 + (+)) when N ¥ 0 mod 4, and 0 otherwise. 


Similarly, the number of elliptic points of order 3 in [9(N) is [ow (1+ (=)) when N ¥ 0 mod 9, and 0 


otherwise. Here, we're using the Legendre symbol except with the convention that (5+) = (=3) = 0 - this is 


only because p = 2 is weird when we're solving the equation n? + 1=0 mod N. 


Diamond and Shurman 2.4-5 — Nikhil Reddy 


We'll talk about cusps in this lecture. Recall from previous lectures that a congruence subgroup I (N) provides a left 
action on the upper half-plane, and then we define the modular curve Y(T) to be the [-equivalence classes on HI. Last 
time, we showed that Y(T) is a Riemann surface, and we'll be extending that result here. 

If we look at the fundamental domain D (which is the points with real part between —$ and 5, outside of the unit 
circle) on the Riemann sphere, the region looks like a triangle (circles and lines are basically equivalent), except that 
we're missing the point at infinity. So we'll introduce that now: when we adjoin it, we also need to adjoin its orbit in 
SL5(Z). We can show that the orbit is rational, but we need to first show that we can get frmo oo to any rational 


point: 


Lemma 178 


a b 
For any r = } € Q, there exists y = ] € SL»2(Z) such that ¥ is sent to r. 
G 


To prove this, we just find c,d € Z such that ad — bc = 1 by the Chinese Remainder Theorem. 


Definition 179 
Define H* to be HU QU {oo}. The compactified modular curve X(T) is 


X(P) = P\H* = Y(T) UP\(QU {oo} ). 


We use the notation Xo9(N), X1(N), X(N) for the special congruence subgroups. A cusp is a I-equivalence class 


of F\(QU {oo}). 


Proposition 180 


X(T) always has finitely many cusps. In particular, X(1) has only one cusp as we showed above. 


Proof. The index of F is finite in SL2(Z), and now take all of the left coset representatives and multiply cusps by 


them to generate other [ equivalence classes. This reaches every rational point, so we must have reached all of the 


equivalence classes, and thus there are finitely many cusps in total. 


We use the Euclidean topology for H, but we need to do a bit more for H*. 


Definition 181 


A neighborhood around oo Is of the form 


Nu = {t € H: Im(r) > M}. 
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The idea here is that circles centered at oo look like lines. Then we can define neighborhoods My U {oo}, and we 
can also get neighborhoods around rationals by defining a(Mjy U {co}) to be a neighborhood around a(oo) for any 
a € SL5(Z). Since we've defined our neighborhoods this way, ‘y is always a homeomorphism, and we get the quotient 
topology from the natural projection map m : H* + X(T). 

Basically, the neighborhoods around rational points are open disks tangent at the rational point, plus the rational 


point itself. 


Theorem 182 


X(T) is a compact Riemann surface. 


Recall that a Riemann surface needs to be connected, Hausdorff, and it needs a series of charts (sometimes called 
an atlas), which are pairs (Um, Vin) such that U,, is a neighborhood around m and V,, is some open set in C (often the 


open disk) such that we have a homeomorphism $m : Um — Vin. In addition, if U,, and U, intersect, the induced map 
Pm(Um O Un) + bn(Um 1 Un) 
should be holomorphic as a function from C to C. And we just need to add being compact to this list. 


Proposition 183 


X(T), the compactified modular group, is connected, compact, and Hausdorff. 


Proof. First, we show that HI* is connected and compact, and then we can project down with the projection map. We 
showed previously that HI is connected, so if HI* is a union of two disjoint open sets, then one must contain H and the 
other is contained in QU {oo}. But there aren't any nontrivial open sets of QU {oo}. 

For compactness, we can first show that the fundamental domain plus the point at infinity DU {oo} is compact in 


H*. Then by definition, we know that translates of D* cover HI*, so 
H* = SL2(Z)(D*), 
and then we can break up SL2(Z) into the right cosets of [: 


= Urata’). 


Apply a to each term to find that 


x(F) =U a(ry(D*)). 


j 
But a and y, are both continuous maps, so this means X(I) is a finite union of compact sets, which means it is 
compact. (We have a finite union because [ has finite order in SLo(Z).) 

Being Hausdorff is a bit more complicated: recall that the idea is that two points pi, p2 must have neighborhoods 
Uz, Uz such that Uy; U2 = @. We now only need to consider the cases where the two points aren't both in HI (because 
we did that in the previous class). 

In the first case, when s; € QU {co} and 7) € H, take some neighborhood U> of T2. We need to show that there 


is a neighborhood of s; that does not intersect yU2 for any y € T, but note that 


Im(a(T)) < max {}. 
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Thus we get an absolute upper bound on the imaginary part of (7), which means that we just need to pick a large 
enough M such that Nyy U {co} does not intersect SL2(Z)U>. And now if a1(oo) = 51, we have 


SLo(Z)U2 Nar(My U {oo}) = ©, 


and we've found our two neighborhoods. 
In the other case where we have two points 51,52 € QU {co}, we'll pick two neighborhoods (such that a, takes 


oo to Ss; and a» takes oo to 59) 
Uy = ai(No U {oo}), Us = a2(No U {oo}). 


In No, we're high enough in the imaginary axis that the only way we can have equivalence is a translation. If the 
neighborhoods intersect, then we know that yai(T1) = Q@o(T2) for some y € T (the two points are equal up to a 
y-action). This means that a5*yaz is a translation, which means that y(s,) = So because oo is fixed. But that 


means s; = 52. Thus the neighborhoods don't intersect for distinct points, and now we've indeed showed that X(T) 


is Hausdorff. 


From here, we have to deal with our charts: last week, we discussed this for any 7 € HI. Basically, we start with 
some open set U whose center Is an elliptic point — because things look equivalent around such elliptic points, we'll 


have regions that are [-equivalent, which means its map to a unit disk is not so nice. So we had to use a straightening 
1 
map 6 = 
1 


conjugate goes to co), and then we see that U becomes a sector of a circle. We then make that into a full circle is 


- last time, which sends our neighborhood to a neighborhood of 0 (because T goes to O and the 
to use p, which raises 6(U) to some power. (This is the identity except at elliptic points, which have hy greater than 
1.) Notably, we make the neighborhoods small enough that the U; avoid other elliptic points. 

We need to figure out how to define our charts for H as well. We'll do the same strategy: starting with a 
straightening map, and then identifying things that are [-equivalent. If we look at this picture from oo, we can 
consider the neighborhoods N> U {oo}. Under F = X(1), this neighborhood is periodic every 1, and that is the only 


equivalence. In general, this is periodic every h, and thus we just need to use the map 


0(T) = e2tit/h 

This is sort of like the g-series map earlier, where we sent our map into a Fourier series on the unit disk. So if we take 
an s € QU {ox}, we will send our point to oo first, and then we'll use the identification map p to fill out the whole 
disk. (We might use different h,'s for different points. ) 


Formally, we're defining 


hs = |SL2(Z)o0/(S{+1} TO" )ool. 


1 oA 
Basically, we're extracting out the fact that (676~+) is generated by ; i. It should be clear that hg is finite — 


otherwise, we get an infinite index of SLo(Z). 

Defining w to be the composition @o 7, we know that w is not necessarily a bijection. However, we can show that 
am(U) (which is our modular curve) is in bijection with V, because only the [-equivalent points are sent to the same 
point. mw and w are both open continuous maps, so 7m is open and continuous as well, which shows that @ Is indeed a 
homeomorphism. 

To show holomorphicity, the idea is that “maps look more or less holomorphic.” When we have one point in HI and 


one in QU {oo}, consider the map from V to U;, which sends g to 6. Ge: After applying this map, we go from U, 
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to U> by applying some ¥, and then we go from U> to V» by applying the exponential map e27/2*/'_ Everything here 
is holomorphic except when we're taking the Ayth roots. But our domain never contains 0, because that means that 
U,M Us contains an elliptic point, and then (U2) contains the elliptic point (7) (and we defined our neighborhoods 
so that the only elliptic points are possibly at the center), so we can always define q'/™ in a consistent way. 

For the case where we have two cusps, we know that xy; * is a translation if our neighborhoods intersect, and 
then we just get another chain of holomorphic maps — this case isn't too bad. 

And now we're done — we've showed all of the required conditions for X being a compact Riemann surface. 


We can now state a first version of the Modularity Theorem: 


Theorem 184 


Let E be a complex elliptic curve with rational j-invariant. Then there is a positive integer N such that there is a 


surjective holomorphic function X(N) — E, known as a modular parameterization. 


15 April 9, 2020 


Diamond and Shurman 3.1 — Christian Altamirano 


We'll talk today about the genus of compact Riemann surfaces. There is a rigorous definition for genus, but intuitively, 
it is just the number of “holes” in our surface (so 0 for a sphere and 1 for a torus). Our goal will be to compute this 
number for a few Riemann surfaces. 

Recall that the modular curve X(T) is defined to be the set of orbits {[7 : 7 € H* = HUQU {oo}}. We showed 
earlier that X(T) is a Riemann surface, and whenever we have a nonconstant holomorphic map f : X — Y between 
compact Riemann surfaces, f is surjective, and f~1(y) is discrete (so finite) for any y € Y. 

It turns out that there is a well-defined degree d € Z* for our function f, so that |f~!(y)| = d for all but finitely 


many points y € Y. We'll be proving something a bit more general: 


Definition 185 


For any x € X, let e, € Z* be the ramification degree of f at x. In other words, ey is the multiplicity with which 


f takes 0 to 0 as a local map, meaning g(x) = x®h(x) and h(x) 4 0. 


Recall the following theorem from complex analysis: 


Theorem 186 (Local Mapping Theorem) 
Let f be a holomorphic function, and suppose that f(z) — wo has a zero of order n (that is, with ramification 


degree e, = n). Then points w near wo will have n distinct roots for the solution f(z) = w near z. 


Lemma 187 
There exists d such that 


for any yEY. 


#2 


Proof. Let € be the set of exceptional points — that is, the points with ramification degree more than 1. This set is 
finite, because all such points are roots of f’, which has finitely many zeros. Therefore, X‘= X\€ and Y’ =Y \ f(E) 
are both still connected. Now fix y € Y’; we know that for each x € f-tiy), we have a neighborhood U, such that f 
is locally bijective on Uy (because x has ramification degree 1); shrink these neighborhoods so that they are disjoint 
from each other, and such that they map to the same neighobrhood V € Y’ containing y. (This is okay because there 
are finitely many images in the preimage.) Now we can define a function y — |f~+(y)| on V; this is a continuous 
function and it is integer-valued, so it must be constant. 


Let that constant be d: we now know that >> e, = d for all points y € Y’ (everything except the exceptional 


xef-(y) 
points). To extend this to Y, note that whenever y = f(x) and x is an exceptional point, we can find a neighborhood 


N(y) of y such that every point in that neighborhood has ee) e€, = d. (Basically, we're replacing e, points of 


multiplicity 1 with 1 point of multiplicity e,). Thus we have the result that we want. 


Definition 188 
Define the degree of f : X — Y be the unique d € Z* such that 


for ally EY. 


Theorem 189 (Riemann-Hurwitz) 


Let gx, gy be the genera of two compact Riemann surfaces X and Y. Then 


29x —2 = d(2gy —2)+ )(e& — 1). 
xXEX 


Proof sketch. The idea Is to triangulate Y — we get Ey edges and Fy faces, and we know that 2—2g = F—E+V for 
any surface. Then lifting under f~+ yields a triangulation of X with dEy edges and dFy faces, but we lose > (ex — 1) 


vertices due to ramification. 


So now we can return to modular curves: suppose that [; C [2 are congruence subgroups of S/2(Z). There 
is a natural projection f : X(T1) + X(P2), sending the orbits [37 — ToT. We can think of this as a nonconstant 


holomorphic map between Riemann surfaces, and we can therefore calculate the degree: 


Proposition 190 
We have 
[Fo :04]/2 -l€fo,-l¢€ly 


degC i sella tee) | — 


[Fo : 14] otherwise. 


Proof. Partition {+/}[2 into a coset partition U{+/}liyj. Pick a point [27 (an orbit in X(P2)) such that this is not 
the image of a point that ramifies; we'll show that f~'([2T) = {liyjT}. (There are only finitely many points that 
ramify. ) 

We verify both inclusions: [yy;7 € f~!([2T), because we'll take f([19;7) to ToyjT = Pot, and for any [y7' € 
f-+*(PoT), we know that f([y7') = Po(t’) = Tot, and then we can find y such that 7’ = yT, meaning that 


Tyr! = Tyr. 
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There is no ramification here, so e, = 1 for every point in f-l(Par), meaning that deg f is the number of cosets. 


Here, recall that we multiply by {+/} in the congruence subgroups, because —/ fixes any point and the action of 
SL»2(Z) factors through +/. 


Earlier on in the class, we discussed the local structure on Riemann surfaces with the straightening maps 6 and 


wrapping maps p1(z) = z™, p2(z) = z'. For a subset U of H, define p; 0 6(U) = Vy and p20 6(U) = Vy. If we 


denote [;, to be the isotropy subgroup of 7 in j = 1,2, and we proved last time that hy = |{+/}I;7|/2 € {1, 2, 3} 
(because elliptic points have order either 2 or 3). Since z should be integral, because the local map from V, to V5 Is 


the holomorphic map g'2/", there are very few cases: we must either have hy = 1 or hy = ho. 


Proposition 191 


The ramification degree for 7 € H Is ho when 7 Is an elliptic point for [> but not for [, and 1 otherwise. This is 


also the size of the quotient subgroup [{+/}l 27 : {+/}04,7]. 


Similarly, we can define a ramification degree for cusps. Earlier, we showed that if U is a neighborhood of s € 
QU {oo}, our maps p1(Z), p2(z) are e272/M and e27/2/' respectively, so this time the local map looks like g > q’/', 
where hy = [SLo(Z)oo : {+/}1;,5] are the widths. 


Proposition 192 


The ramification degree for an s € QU {co} is zB. 


{elt [eee 


This is also the size of the quotient subgroup [{+/}I2. : 


Proposition 193 
Let [; be normal in Tz. Then all points in X([1) with the same image in X([2) have the same ramification 


degree. 


Proof. Suppose [471 and [47> are points in X(T 1) that map to the same point in X([2). We know that [271 = [2To, 


so we can find yy such that y7, = T2. And the idea from here is that conjugation by y does not change the period. 


This now allows us to compute the genus: recall that for a group action G on X, Gx Is the orbit of x € X. We'll 
consider the case [y =T and [2 = SL2(Z). 


Definition 194 (Local definitions) 
Let yo = SLo(Z)i, y3 = SLo(Z)e2"/3, and ys, = SL2(Z)oo. These are an elliptic point of period 2, elliptic point 


of period 3, and cusp of X(1), respectively. Let €2, €3 be the number of elliptic points of [ in f-!(y2) and f—+(yz), 


and let €,, be the number of cusps of X(T). 


But any elliptic point [7 of period 2 has an image that is also an elliptic point of period 2. (If h, > 1, then 
hy = hy > 1). The only elliptic point of period 2 in X is yo, so we indeed have [Tt = yo. The same argument works 
for €3, SO Eg and €3 account for all of the elliptic points. 

So now for h = 2,3, we know that the ramification degree of the points in f~!(yp) is A for all elliptic points and 1 


otherwise. Thus, the degree of f is 


d= » ey = h+(|f-"(yn)| — En) + 1° En. 
xe f-l(yn) 
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Similarly, we can find €,, by noting that 
d= S- e& => S- (Qc— 1) Hd — ee; 
xXEF~ (Yoo) XE fF (Yoo) 


and then we can apply Riemann-Hurwitz: 


Theorem 195 
The genus of X(T) is 


g=14 


where d is the degree of the natural projection f : X(T) > X(1). 


(Here, we've used the fact that the fundamental domain D has genus 0 — it’s basically a triangle.) 


Theorem 196 
Let p be a prime, and let k = p+1. The genus of the congruence subgroup Xo(p) = X(To(p)) is 


[| -—1 k=2mod 12 
|| otherwise. 


k will show up later as the weight of some modular forms — this should remind us of computing the dimension of 


cusp forms of some weight k. 


. The point of these ajs 


1 0 1 -1 
Proof. We'll need a few results: let aj = ; 1 for j € [0,p — 1], and let ay = ; ‘ 
J 
is that we can split SL2(Z) into a disjoint union Uj; Fo(p)ay. 


We'll skip the proofs of a few computational results: 


Lemma 197 


Xo(p) has exactly two cusps at 0 and oo. 


Lemma 198 
For any 0 <j < p, we have ya;(i) = a;(/) for some y € ['9(p) of order 4 if and only if 2? +1 =0 mod p. 


The purpose of these results is that the number of elliptic points of period 2 is the number of solutions to 
x? + 1=0 mod p. This is because we have a nontrivial martix -y that fixes aj(/), so a;(/) is an elliptic point, and 
because It is in the orbit of /, it must have period 2. And we know how to compute this number mod 4 using some 


results in algebra (it’s 2 for p = 1 mod 4, 0 for p=3 mod 4, and 1 for p = 2). 


Lemma 199 


For any 0 <j < p, wehave ya;(e2"//3) = aj(e27//3) for some y € Fo(p) of order 6 if and only if /2—j+1 = 0 mod p. 


The logic is the same here, and we also know how to compute this number mod 3. Putting all of this together 


means that we can do casework on the residue of p mod 12, to give the desired result. 
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Example 200 


Setting p = 13, this is the smallest prime where all four possible elliptic points occur. 


We can also consider some coset representatives of SL5(Z)/T9(13): 


1 0 1 -1 QO -1 
Ed ab dt a 


and this partitions the fundamental domain D of Xo(13) into 14 regions. There are 13 points in SL2(Z)/, and we can 


0 1 
1 0 


actually associate these with the 6;(/): it turns out that jj’ + 1 = 0 mod 13 if and only if y6;(/) = By (1) for some + 
of degree 4. 

We also yield some facts about elliptic points: for example, because 5-5+1=0 mod 13, this gives So this allows 
us to Identify points of the boundary arc with each other, as well as understand the orientation of the fundamental 


domain. 


Modular forms and representations of real groups — Professor Kim 


We'll talk about how modular forms induce representations for real groups (like SL,(IR), SO,(IR), and so on). 


Definition 201 
Let G be a group (in general, we're interesting in locally compact groups such as S/2(IR)). A representation of 


G on a vector space V,, is a continuous homomorphism a: G > Aut(V,). 


Definition 202 


Let G act on a topological space X, and let Vx = C2(x) (the space of compactly supported smooth functions). 


Define the right regular representation R, via the action 


(Rx(9)F)(x) = Fxg) 


when X =G. 


One of the questions to study is the space of L? (square-integrable) functions L?(G) or L?([\G) for some discrete 
subgroup. The point is that modular forms (as well as Maass forms) generate representations of S/>(R), which 
induces an automorphic representation. 

To understand what's going on, let’s fix a group G = SL2(R) and let K = SO(2), T = SL2(Z). We know that 
G/K is homeomorphic to H, because K can be characterized as the stabilizer of / in G, so we send g to gi. Now 


recall that a modular form is a holomorphic function with the automorphic condition 
F(g,z) =J(g,z)*F(z), 


where j = cz +d. This identity can be notationally rewritten as (f|,g)(z) = f(z) — this is the [y], operator from 


earlier in the class — and now we can define a function 


br(g) = Fl«(9)(i) =5(9,2) *F(gi). 
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This function ¢¢ inherits some properties from f: it is smooth (because f is nice), it defines a function on the quotient 


cos@ —sin@ 


space C®(p\SLo(R), and the right action of the rotation matrix rg = € K pops out as a “character” 


sin@  cos@ 
ink: 
br (gre) = er (g). 


This means that ¢- is a one-dimensional representation of K. 

Also, since f is holomorphic at oo, there is moderate growth of this function ¢¢ at joo. (In addition, if Ff is 
cuspoidal, then @¢ is rapidly decreasing.) Finally, because f is holomorphic, oe = 0, which means there exists a 
differential operator F such that F- dr = 0. 


In general, the above properties define automorphic forms of SL2(IR), and more generally any real reductive group. 


Definition 203 


Let G be a connected real reductive group. and let [ be an arithmetic subgroup of G. A function @ on [\G is an 


automorphic form if ¢ is smooth, of moderate growth, is K-finite (the vector space K@ is finite-dimensional), 


plus another property related to Maass forms and the Lie algebra. 


We can let A(G,I) be the space of automorphic forms on G with respect to [. This is G-stable with respect to 
the right-regular representation, and we can use this to study L?(G) because the cuspoidal automorphic forms are 
contained in L?(G): Ao(G,F) c L?(G). If we go back to the SL2(Z) example, take a function f € M,(F), and 
consider 


Vo, = (R(g)brlg € G). 


This gives us a holomorphic discrete series 7, of minimal weight k, and we can make the connection here: 


Mk(T) = Homeg, «(4K A(G,T)). 
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Diamond and Shurman 3.2 — Zack Chroman 
We'll be talking today about automorphic forms. Recall from earlier in the class the weight-k operator 
(flok)(7) =s(a, 7) *F(a(7)), 


where j(a,T) = cT+d anda € SL>(Z). In Serre, we defined a modular function in order to define a modular form, 


and we went from that to restricting to specific congruence subgroups. We'll work backwards now: 


Definition 204 


An automorphic form is a function f : H — C that is meromorphic, weight-k invariant under the congruence 


subgroup [, and meromorphic at infinity for all a € SL2(Z). Denote the space of automorphic forms at weight k 


as A, (T). 


When we look at the case of k = 0. Ao(l) = C(X(I)) corresponds to the space of meromorphic functions on 


X(l). This is because our functions need to be invariant under [, so we can define on a reduced domain X(T). 
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Theorem 205 


We have that Ag(SLo(Z) = C(y), where j = ae 


Here, C(j/) is the space of rational functions in /. 


Proof. Clearly C(j) is contained in Ag(SL2(Z), because / is weight zero. To go backwards, we'll need to do more 


work. We know that the Laurent series is of the form 
; a ee 
ir) ==+ 0 ang”, 
q n=0 


so the only pole of / is at infinity, because that’s the only point where A = 0. So j is holomorphic on the space Yj; 
(which is X1 without the point at infinity), which means j/ : Y(1) = C is a homeomorphism. In general, if we have a 


meromorphic function f which has zeros z; and poles p;, we can define 


Tis Gi(7) — i) 
TI? U(r) — i(p)) 


This function has the same zeros and poles as f, so f has no poles and zeros at any finite point. And the number of 


g(t) = 


poles and zeros need to add up to the same number on this compact Riemann surface, including multiplicity, so the 


ratio must have no zeros or poles even at infinity, which means that f = cg for some constant c. Since g is a rational 


function of 7, this means f is also a rational function of T. 


Later on, we'll be able to compute Ao(T) for other congruence subgroups, too. 

Our next goal is to extend this result — when k 4 0, we can’t make our automorphic forms into a map on X(T), 
because even if we have two points that are equivalent under T, they'll have some conjugate value under the action. 
But we should be able to define the order of vanishing consistently, and what’s morally going on is that two points 
under the action of y € T will keep the order the same because the factor of automorphy cz+d has no zeros or poles 
on H. 


Definition 206 


If m(7) (the projection from C to X(T)) is not a cusp, define the order of vanishing at 7(7) as 


v,(F) 


Var) (F) = A’ 


where h = |4 t || is the period of the elliptic point 7. 


The idea is that the coordinate charts are no longer injective — they're h-to-one. So this order of vanishing is 
sometimes not an integer, if we're looking at elliptic points where we have half- or third-integers. 
But the case where m(rT) is a cusp, the order of f at 7 is more complicated. We'll consider 7 = oo first: recall 


2Qnit /h 


that we needed the local coordinate g = e , and then we wrote f as a power series in g, defining the order to 


1 oA 
be the lowest degree term. Here, A is the width: it’s the minimal A such that one of + F ( are in T. 
If f is h-periodic, we can write down a power series in g, and everything works out. But it’s possible that — i 


al 


1 oA 
isin T, but E ( is not. Then f(t + h) = —f(7T), meaning our function is 2h-periodic instead of h-periodic. 
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So we'll define h’ = 2h, so that we're writing our power series as 


f(T) = S- andy 
n=m 


and then we'll define the order at infinity v.,(f) = A 


And to get the complete definition at cusps, we just need to conjugate from oo to s: 


Definition 207 


Let s be a cusp. Then the order of vanishing of 1(s) is 


Us(f)/2  k odd, (a Ta) = (- 


Urtayit } = al 


v;(f) otherwise, 


where a is the element of SL2(Z) such that aco = s. 


In almost all cases, f will be h-periodic, and we won't have the extra case. We'll call the extra case an irregular 
cusp, and we'll see later on that the only case this happens for our familiar congruence subgroups is s = $ for [1 (4). 


From here, we’re done with the definitions, and we now have a nice result to characterize the space of cusp forms: 


Theorem 208 
Let k, N € N such that k(N +1) = 24, and let S, be the space of all cusp forms of weight k. Define ¢,(T) = 
nr) n(N7)*, such that (7) = e°"/-* [1 = @"). 


© If Sk(Pi(N)) is nonzero, it is equal to Cx. 


+ In addition, if S,(Fo) is nonzero, then S,(To(N)) = Sx(Tr(N)) = Cob. 


Here, N is either 1 or a prime number, so the second point follows from the first because S,(To(N)) C Sx(Fi(N)). 
An important case is N = 1, k = 12: this tells us that all cusp forms of weight 12 over SL2(Z) are multiples of A. 


Proof. We can first define 
9g = Oh = (20) A(T) A(247), 


and some more substitution yields 


- qht1 [[a _ yea _ ge. 
n=1 


Now A(Nr) is a cusp form on Sy2(Fo(N)), so substitution tells us that g € Soa(To(N)) C Soa(Ti(N)). 


Lemma 209 


Our function g has a zero of order N + 1 at every cusp m(s) (here, m is the projection map onto X(T1(N))). 


Proof of lemma. We'll need to blackbox a few results: it turns out that all cusps are regular for N 4 4. Also, the cusps 
of X(To(N)) are mo(oo) and 79(0), and the widths here are 1 and N respectively. Finally, we'll need the projection 
map X(T1(N)) > X(To(N)) to be unramified at the cusps. 

The second assumption is the most important here, because it gives us a handle on the cusps on the larger space 
X(Pi(N)). For any cusp s € X(Ti(N)), we know that m(s) lies over either 1(00) or 1(0). 
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In the first case, where m(s) is a Cusp over (oo), we can write s = aoo for some matrix a € [o(N). Since g is 


T4(N)-invariant, it is invariant under the weight-24 action of a, which means that 


Vrs) (9) = v;(g) == Voo(g) =N+1, 


where the last equality comes from the direct expansion of g. 


0 -1 
In the second case, m(s) is a cusp over 79(0), so the weight-24 operator of S = F P | on g looks like 


co 


g[S]oa = N~*an** | (1 — ah)**1 = a), 
1 


n= 


where qy = €27'/N. Since S sends oo to zero, the order of (S) at g is the same as the order of 1(0) at g, which is 
also N+ 1. 


One important point is that we needed to write things in terms of qy instead of our usual g, because the width of 
To(0) is N. So now we can finish the proof: if f € S,(Fi(N)), we can consider the automorphic form —_ This is 
holomorphic on Y(f4(NV)), because g has no zeros (it’s the product of A functions), and because of our lemma, this 
is also holomorphic on the cusps. This is because the numerator f+! has a zero at every cusp, so the order is at 
least as large in the numerator as the denominator. But then this is a holomorphic function defined on a compact set 
X(T1(N)), so we actually have that oe = c for a constant c, which implies that f = c’bx € Cox (dx is one of the 
(N + 1)th roots of g, and we need to pick the same (N + 1)th root by continuity throughout). 


We haven't shown that @, is actually a cusp form yet — we do know this is true for (N, k) = (12,1), because then 
Sx(SLo(Z)) = C(A). On the other hand, whenever k is odd, S,(To(N)) = 0, because —/ is in [o(N). But it does 


turn out that (using the dimension formulas that we'll go over in the next few classes) we do have 
Sk(To(N)) = Sk(Pa(N)) = Cox : 


we have a space of cusp forms of dimension 1. 


Diamond and Shurman 3.3 — Michael Tang 


The topic of this section is meromorphic differentials — these will be helpful in talking about modular forms for the 
rest of the chapter. Today, we'll go over some motivation, define local and then global differentials, and then we'll 
go from differentials to automorphic forms. On Thursday, we'll finish the lecture by going in the opposite direction, 
giving us an isomorphism between automorphic forms and differentials, and we can use this to talk about dimension 
of modular forms. 


We'll start with the transformation rule: for any function f € Azp(T), we have 


F(y(T)) =J(yT)"F(7) Wr eT. 


dy(t) _ 


We can notice that —,= 


J(y,7)~? by computing with the quotient rule, so we can now say that 


Fo) = (2) ren, 


and now we can symbolically treat the derivative as a fraction and rewrite as 


i = lay) = Tran) 
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This doesn’t have any real meaning yet, but we'll make it more formal later. But this is nice, because f(z)(dz)” is 
invariant under the action of [ — we'll make the definitions make sense now, so that we can exploit the useful properties 
here. 


We'll start by looking at open sets on C: 


Definition 210 


Let V CC be an open set, and let n € N. The meromorphic differentials on V of degree n are 


Q°"(V) = {f(q)(dq)" : f meromorphic on V}. 


We can set up a vector space structure over C, because f and g being meromorphic means fg,cf are both 
meromorphic as well. The dq here is just a symbol — everything is really just determined by f and n. 

In order to study this on Riemann surfaces, we'll need to talk about a mapping between local differentials: if we 
have two open sets V1, V2 and a holomorphic map @ between them, we'll induce a pullback map ¢* in the other 


direction between the differentials, defined by 


b* (F(q2)(dq2)") = [F(b(a1))(¢'(1))"IMda1)”. 


This is basically a u-substitution. We can confirm that whenever ¢ is holomorphic and f is meromorphic, (f o ¢)(¢’)” 


is also meromorphic. 


Example 211 


If we take the inclusion 4: V, C Vo, then v* Is just a restriction: 


U*(F(q)(dq)") = [F(eCq)) 1" (da)" = Flu (a)(da)’. 


We can note a few properties of this pullback map: 

- @ isa linear map from Q2®"(Vs) to Q°"(VYY). 

+ @ is a contravariant operator: (¢2 0 ¢1)* = Gj o G5. 

Both of these results are just bookkeeping of our earlier definitions. We'll also state a few properties without proof: 

- If @ is surjective, then ¢* is injective. Similarly, if @ is a bijection, so is @*, and we have the identity (¢*)~-1 = 
(p*)*. 

We now want to define differentials on the entire Riemann surface X. Recall that when we defined the Riemann 


surface, we used a collection of coordinate charts (¢,)jey, each of which maps neighborhoods U; of X (here the U; 


must cover X) to open sets in C, and one condition is that they must be compatible: 
Pj = Px © oj * 


must be a holomorphic map from V; + V, for all j,k € J. So similarly, a global differential should be a collection of 


local differentials which are compatible. 
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Definition 212 
Let X be a Riemann surface with charts ¢; : U; — V;. A meromorphic differential on X of degree n is a 


collection of (w,)jey, where w; € Q°"(Vj), such that for all j,k € J, 


oi. (Wal6,(U;nu.)) = jl by (UjAUk) 


(the pullback of the transition map does send w, to w; whenever both are defined). We denote the set of 
differentials by Q°"(X). 


Example 213 
Consider X = C/A to be a complex torus. Recall that the coordinate charts @; are just the projection map 
mam :C-— X, and the inverses are just defined locally in the simple way. Then the transition maps are then just 


“corrections between fundamental domains:” we have $x, ;(Z) = z+ A for some A € A. 


We can check the compatibility condition — because the inverse should have derivative 1, we can just say that 
w; = dz for all y € J, and then indeed Pj, j (wi) = dz = Wx, and the collection of ws makes sense on X. 

So now we can work towards automorphic forms: let [ be a congruence subgroup of S/2(Z), and we'll spend the 
rest of this lecture on X([). Remember that neighborhoods of X(I) look like m(U;), where 7 is a projection from H* 
to X(T) and U;. Then we defined the coordinate charts indirectly via 


Wj =O OT, 


2ni5(T)/h 


where w; = e is the identification map we've been disussing. So the local differentials are defined on Vj, and 


now we can define a global pullback of w by pulling back the wys individually: 
n*(w)ly = VF (Ww), 


where the primes mean that we're only working on H. We just need to make sure these local pullbacks agree on the 
intersections of the Ujs, in the same way that we needed to make sure the differentials were compatible. In other 
words, we need to show that 

Vi (wily) = Ve = We (Waly) » 
where Vig = Wj(U; U;) and Vij = Pe(U; AN Uy). 


Proof. We know that w; = ¢;0 7 and W, = b, © 7, So solving for 7 yields 
voor" =TSod, 


This means that 
bj = bx O77 = Voz’, 


meaning that we can write Wx = oj 0 Wj. This is helpful, because the compatibility condition for the ws has to do 


with the pullback of the transition map. So taking the pullback of both sides and using contravariance yields 


Vie = Uj Dig > VW) = V7 (O4j(We)) = YF (wy), 


as desired. 
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This shows that 2* is well-defined, because the local pullbacks take the same value whenever they intersect. And 
the Uis cover H — we'll talk on Thursday about extending this to the cusps, too. 
In summary, if we have a meromorphic differential w € Q°"(X(T)) of degree n, we can construct a meromorphic 


differential m*(w) on H,, also of degree n. We can then define the function f such that 
W*(w) = f(T)(dT)". 


Since m*(w) lives in X(T), the pullback doesn’t care about the individual elements of [, so it must be [-invariant: in 
other words, * acts trivially on Q®"(X(T)). 
And then 
F(r)(dr)" =" (F(r)(dr)") = F(y(r))('(7))"(dr)", 


and plugging in the definition of the derivative in our case yields 


‘=f muag 


and now we've gotten that f is weakly modular of weight 2n, which is a (roundabout) way to motivate our definition 
of weakly modular! 

Next time, we'll show that f is actually an automorphic form — we need to show meromorphicity when acting by 
the weight-k operators and at infinity. It turns out that it’s easier to study differential forms than their corresponding 


automorphic forms, so this will help us get some more insight out of the group Aop. 
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Diamond and Shurman 3.3 continued — Michael Tang 


We'll continue discussing meromorphic differentials today — as a recap, recall that we defined the local differentials 
F(q)(dq)" € Q®"(V) on open sets V C C, and then we defined pullback maps from Q°"(V2) > 2®"(VY4) given a 
homomorphism between open sets V; + V2. We combined these local differentials into global differentials on Riemann 
surfaces, making sure that they were compatible through the pullback (transition) maps. Specifically, if we consider 
a congruence subgroup F C SL2(Z), we constructed a global pullback f(7)(d7)” on the modular curve X(T). We 
noticed that this yields a weakly modular function of weight 2n, and it turns out that fla], is actually meromorphic at 
oo, so f is an element of A»o,(F). This is important, because it gives us a map between differentials and automorphic 
forms. 

Today, we'll consider the inverse mapping as well: we'll start with f € Ao,(T), and we'll construct the w that pulls 
back to f. This will give us an isomorphism between the spaces of meromorphic forms and automorphic forms, and 
we'll see why this is important. 


Remember that the mapping w — f was defined via the pullback map 
m (w)lur = Yj (Wj) = F(7)(dr)”. 


Here, U; is an open subset of H, 7 is the projection map, and w takes a neighborhood U; to a neighborhood Vj. To 
show this was valid, we needed to show that the local pullbacks agreed: the compatibility condition we needed to 


satisfy was 


WeWk) = (W7 0 Hy) (We) = Vj (wy). 
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Let's show the converse Is true as well: 


Lemma 214 
Suppose (w;) is a collection of differentials on the Vjs, and let wy : Uj; — V; be identification maps for X(T). If 


W; (wi) = W;(w;,) on the intersections, then the wjs are compatible, meaning they define a global differential. 


Proof. Run the previous argument in reverse: each equality is an if and only if statement. 


This is helpful because we can start with an f, construct the individual w s, make sure they pull back to f(7)(drT)”, 
and that will give us the w. 
For the inverse construction, let's start with an automorphic form f € Az,(T). We are doing things locally, so we 


can start with a neighborhood U; € HI*, we want to construct an w; so that 
Wj (wj) = (F(T) (dT)")|y;- 


We'll want to do a “pushforward,” but we can just try to take an inverse map w~+. Unfortunately, w; isn’t invertible 
in general, so we'll need to look more carefully what’s going on. Recall that when we set up the charts of Riemann 
surfaces, we wrote 


Wj = pj 06 


a ; ; lL —t% ; 
where 6; is a linear transformation (the action of the matrix ul ), while p;(z) = z" for non-cusps and p;(z) = 
Ty 
e27/2/h for cusps. So we can try to do our pushforward in two steps: construct an intermediate differential such that 
pulling back by 6; gives the original differential w;, and then pulling back by p; gives the intermediate differential. 
We'll start with the linear transformation: luckily this is invertible, because the determinant is nonzero whenever 


T, € H, so we just need to define 
dj = a" ((F(7)(47)"lur) 


where a = 5;*. And doing some computation with this, 
dj = at (F(r)(dr)"ly)) = Fa(z))(a"(z))"(dz)", 
and here a’/(z) = det aj(a, z)~*. (We can't assume ad — bc = 1 for the matrix like usual.) This yields 
= f(a(z)) - (det o)"j(a, z)-*"(dz)”, 


and we can still define (by analogy) that this A; = (f[a]on)(z)(dz)" — this is a generalization of the weight-k 
operator. 

So now we need to find an w; such that p7(wj) = Aj. If we take a small neighborhood Uj, we can try to find a 
formula for a local differential form that will pull back, at least in a small region. We'll just do the case where 7; is not 


a cusp, so we have p;(z) = 2”. 


Lemma 215 
h 


The function z"(f[a]on)(z) is only dependent on z”. 


Remember that fh is defined to be the size of the isotropy subgroup {+/} (6,007 *).,/{+}, and we showed that 


2nih 


this is finite cyclic, generated by a rotation r,(z) = e*™'"z. 
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Proof. This follows from the [-invariance of f(7)(dT)”", which means that a*(f(T)(dT)")) ts 6,05; *-invariant: 
(5798;7) = (G1) oy" 067 = aytar, 
and applying this to a* works out the way we want. And this means rp (Aj) = Aj, so 
(e77/"2)"(Fla}on)(e7™/"z) = 2"(Fla}n)(Z), 


which means that z"(f[a]o,)(Z) is invariant under a rotation by an Ath root of unity, meaning it is only dependent on 


ran 


The point is that we can now define 
gj(Z") = 2"(F[a}on)(Z) 


for some meromorphic function g;. It turns out that 


uy = BS ay’ 


now works, because we can check that pulling back by g; yields 
0; (wj) = (Fla}an)(z)(dz)” = rj, 


because the (hq)” makes the derivatives work out. So now we've constructed the local differential w;! 


Fact 216 


For the cusp case, we can find a similar property: (f[a]2,)(z) turns out to only be a function of e27/2/", and 


= ot (dq)". 


So in all cases, we have a local differential w;, such that pulling back gives us our original f(7)(d7)”. Since all the 
wy s pull back to the same f, they are compatible, so we get a global differential on the Riemann surface. 


Combining all of this together, we get the isomorphism we've been after: 


Theorem 217 
The space of automorphic forms is isomorphic to the space of meromorphic differentials: for any n € N and 


congruence subgroup T, 


2°"(X(T)) = Aan(F) 


as complex vector spaces. 


We do have a bijection, and we just need to show that there is a linear map between them, but the pullback map 
is linear and we're not doing anything weird with multiplication or addition in ways that are not compatible. 

This is important, because we want to compute the dimensions of the spaces Mx(I) and Sx() (modular forms 
and cusp forms, respectively). Here, modular forms are automorphic forms that are holomorphic, and cusp forms 
are such forms that vanish at oo. Instead of looking at the dimensions directly, we can look at the images under the 
isomorphism, and compute the dimensions of the images in Q®*/2(X(T)). Riemann surfaces turn out to have more 
structure — we'll be able to write things in terms of the order of vanishing in a purely algebraic way. For example, being 


a modular form means that the order of vanishing of f at every point 1(7;) and 7(s,0 is at least 0. 
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Diamond and Shurman 3.4 — David Wu 


We'll follow up on the previous topic by talking about divisors and the Riemann-Roch formula. Throughout this lecture, 


we'll be talking about general Riemann surfaces X, though we really care about X(T). 


Definition 218 


A divisor D on a compact Riemann surface X is a finite formal Z-linear combination of points of X: 


= Se Nyx, Ny € Z, ny, =0 for all but finitely many x. 
xXxExX 


We can look at the set of all divisors on X, denoted Div(x), and this forms the free abelian group on the points 
of X: basically, we add two divisors by adding the coefficients. We'll define a (partial) ordering on Div(x) as well: say 
that D > D’ if the coefficients satisfy n, > nj, for all x. 


Definition 219 


The degree of a divisor is deg(D) = )°, nx (since we assume that nx is zero for all but finitely many x). 


Notice that the degree map from Div(x) to Z is a homomorphism of abelian groups. A natural question is to ask 
about the kernel of this map — one natural place to start is to look at the function field of meromorphic functions 
C(x). (Remember that being a meromorphic function f on X means that pre- and post-composing it with the 


embedding into C yields a meromorphic function on C.) For any such meromorphic function f, we have the divisor 
div(F) = So uX(F)x, 

where vr Is the order of vanishing of f on x. Then we claim that the map 
div : C(X)* > Div(X) 


is a homomorphism as well, because div(f,f) = div(f,) + div(f2). (Just consider the Laurent expansion at each point: 


the powers should add when we multiply the functions.) 


Proposition 220 


The degree of any of these nonzero divisors f € C(X) is zero. 


Proof. A meromorphic function f : X > C is a holomorphic function on €, where C is C without a finite set of points. 


We know that (from Christian's lecture) the degree 


a= >. a) 


x€f-l(y) 


is constant. But the number of zeros is equal to the number of poles if we take y = 0, co. 


As a more handwavy illustrative example, if we take X to be the Riemann sphere and we draw a closed contour, 


this is saying that we can apply the argument principle on both sides of the contour. 
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Definition 221 


Let Div’(X) be the group of divisors of nonzero meromorphic functions, and let Div?(X) be the group of divisors 


of degree 0 (this is the kernel of the degree map). By the proposition above, Div’(X) is a subgroup of Div?(X). 


There are divisors that don’t come from meromorphic functions, but we have a handy theorem: 


Theorem 222 (Abel) 
Let g be the genus of X, and let Ag be a lattice that spans C9. Then 


Div?(X)/Div’(X) & C9/Ag 


(a g-holed torus). 


For example, if we let A; = (Z @ Z and consider the elliptic curve C/A;, we can compute the divisor of 2 where 


g is the Weierstrass function — we know the zeros and poles of this function, so this is easy to do directly. 


Definition 223 


For any divisor D on X, the linear space of D is 


L(D) = {f € C(X) : f =0 or div(f) + D > OF. 


This might look mysterious in general, but if we have D = div(f) for some meromorphic function f, L(D) consists 


of the functions f that “cancel out the poles,” meaning ff must be holomorphic. We can indeed check that 
Ux(f, + f) 2 min(y,(f.), Xx (fe), 


so L(D) is closed under addition. In fact, L(D) is a finite-dimensional vector space: we'll denote its dimension to be 
£(D). Take an n EN, and let w € 2°"(X) be a nonzero meromorphic differential on X. We have a local map at each 
point in x € X of the form 

Wx = f(q)(dq)”, 


where fy just means we're looking at the local map at the point x, and now if vy(W) = Vo(f) (because q is centered 


around 0), we define 


div(w) = S- Vy(W)X. 
x 
Here, we'll have div(w1w2) = div(w1) + div(we) for any wy € Q°"(X) and we € N®'"(X). 


Definition 224 


A canonical divisor on X is a divisor of the form div(A), where A is a nonzero element of 21(X). 


Theorem 225 (Riemann-Roch) 


Let X be a compact Riemann surface of genus g, and let div(A) be any canonical divisor on X. For any D € Div(X), 


£(D) = deg(D) —g+1+ €(div(A) — D). 


This is important because we have explicit information about canonical divisors. (We won't prove this.) 
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Corollary 226 


Take previous notation. We have the following results: 
1. &(div(A)) = g. 
2. deg(div(A)) = 2g — 2. 


3. For any divisor D with deg(D) < 0, we have @(D) = 0. 


4. For any divisor D of degree larger than 2g — 2, £(D) =deg(D) —g +1. 


Proof. For (1), if f is nonconstant, it has a pole (for example, polynomials have a pole at infinity). This means 
div(f) > 0 cannot occur, which means that L(0) only contains constant functions, meaning £(0) = 1. Taking D =0 
in Riemann-Roch, we find that 1 = —g+1-+ @(div(A)), so €(div(A)) as desired. 

For (2), we can set D = div(A), which yields g = deg(div(A)) — g + 1+ 1. Rearranging gives the desired result. 

For (3), suppose that £(D) > 0, so there is a nontrivial element f € L(D). This means div(f) > —D, and taking 
degrees yields deg(D) > 0, so we've proved the contrapositive. 

Finally, (2) and (3) imply (4): we have é(div(A) — D) = 0, and 


deg(div(A) — D) = deg(div(A)) — degD < 0, 


since the degree of D is larger than 2g — 2. 


Recall that Zack's lecture showed us that a nonzero automorphic form j’ € A(T) exists for any congruence 


subgroup [. Let f be any nonzero automorphic form of weight 2: define 
A= w(f) € Q(X(P)) 


to be the meromorphic differential for f. Then div(A) is a canonical divisor, and by (2) above, the degree must be 


(2g — 2). So if we take any positive even integer k, 
i k/2 = Oia: 0.48) 


(multiplying the meromorphic functions and the dqs), and because degree is a homomorphism, this gives us a degree 


of k(g — 1) for the divisor of d*/2_ We know the equality of vector spaces 
QK/2(X(F)) = C(X(T))AN?, 


and for any meromorphic f, deg(div(f)) = 0. Thus by additivity of the degree, every nonzero differential w € 
Q2/2(X(F)) has a degree of k(g — 1). 

So now if [ is a congruence subgroup and g is the genus of X(I), the space of holomorphic one-forms Q;7.,(X(T)) is 
isomorphic to L(A), the linear space of A. By (1) above, the dimension of the linear space £(A) = g, so the dimension of 
S(T) is g. (We can show that the space of weight-2 cusp forms is isomorphic to the space of holomorphic differentials 
of degree 1.) In general, this argument will let us find general dimension formulas for Mx(T) and S,(T). 

In summary, we've defined these divisors in terms of the order of vanishing of the functions. When we proved the 
formulas for dimension on SL>(Z), we did an argument relating orders of vanishing at /, 0,00, and so on, and we're 
generalizing here: we're taking the Riemann surface X(T) and using order of vanishing data to relate that to the space 


of modular forms. 
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18 April 21, 2020 


Diamond and Shurman 3.5 — Kaarel Haenni 


The plan for today is to discuss dimension formulas for the spaces of modular forms and cusp forms of weight k: in 
this lecture, we'll combine results from the past few lectures to answer this question for even k. 

We'll start with a quick review: recall that an automorphic form is a function f : H > C which is like a modular 
form, but we only need to be meromorphic everywhere. If we take a nonzero f € Ax(I), it may not make sense to 
talk about f as a function on X(T) because it’s not constant on orbits, but we can talk about the order of vanishing 
(which is constant). We define this order via v,(f) = ze where hf is the period of 7. We want to talk about the 


formal sum 


div(f)= So y(F)x, 


xEX(P) 
which looks like the divisor from last lecture, but the small technical difficulty is that these v, may be rational rather 
than just being integers. So we'll just fix this: we'll define the rational-coefficient divisor space Divg(X) to be the 
space of formal sums 5) n,x, where ny, are rational and almost all zero. This still has a natural abelian group, a > 
relation, and a degree function (adding all the coefficients, just like for the integer case). 
Riemann-Roch does not extend directly to this Q case, but it will still be useful. Recall that a corollary of Riemann- 
Roch is that when we have a compact Riemann surface of genus g, the linear space L(D) of a divisor D € Div?(X) 


(which is the space of divisors of degree 0) satisfies 
£(D) = dim(L(D)) = deg(D) —g +1 
whenever deg(D) > 2g — 2. We also know that we have an isomorphism of complex vector spaces 
w: Ak(T) + 2°*/? (X(T), 


such that f is sent to w,; and w, pulls back to f(T)(dT)*/?. With this, we'll move on to new material: our goal is to 
reduce the dimension calculations to finding dimensions of linear spaces. We'll look at k > 2 and even, so that we 
know there exists a nonzero function f € A,(I). Let C(X(I)) be the field of meromorphic functions on X(T): recall 
that 

A(T) = C(X(1))F 


(the automorphic forms are equal to the meromorphic forms times a particular function). Using this description, we 


can describe the space of modular forms via 
Mx, (T) = {fof € Ax(T) holomorphic <=> fof =0 or div(fof) > O}, 
which is isomorphic to the complex vector space 
{f € C(X(P)) : % =0 or div(f) + div(f) > 0} 


where we've used the fact that degree is a homomorphism. This is just the linear space of div(f), and it seems that 


we might want to use Riemann-Roch. But div(f) might have non-integer coefficients, so instead we'll just take the 


floor, |div(f)|, which is defined via 
> nyx| - So Ladx. 
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Since div(f9) has all integer coefficients (because fo is meromorphic), we actually have 
div(f) + div(f) > 0 <=> div(f) + |div(f)| > 0. 
This means that the dimension satisfies 
dim(Mx(T)) = &([div(F)J), 


and we'll now try to understand the right-hand side better. Let w be the meromorphic differential which pulls back 
to f(T)(dT)*/? on H: recall that {x2,;} denotes the period 2 elliptic points of X(F), of which there are a total of €9, 


{x3,;} and €3 are defined similarly for period 3, and {x;} and € denote the cusps. 


Definition 227 


We define the formal divisor 


div(dT) = SS 5, Se xa) ee 
i i i 


Here, w kind of looks like f - (dr)k/?, so we want to “expand out the div.” 


Proposition 228 
We do have 


div(w) = div(f) + Adiv(dr), 


Proof. We just compare coefficients, using the fact that 


vs(u) = (A) 5 (1-5) 


at non-cusps, and the same thing without the z at cusps. 


If we rearrange this result and take floors term by term (each x-coefficient only appears in one sum, and the first 


term div(w) has integers), 


[div(F)| = div(w) + >> Hee +S> B x3, 4 » xy 


i i 


We can now use Riemann-Roch on the floor-div: usually there is a canonical differential term which we want to avoid. 


Proposition 229 


The degree of |div(f)| is larger than 2g — 2. 


Proof. We know that the divisor is a homomorphism, so we can bash out the sum: we know that deg(div(w)) = 
k(g—1), so 


: k k k 
dea(Ldiv(t))) = kg 1) + | F] ea + |S] es + Sen 
and then we bound by the worst case based on k: 


—2 k—2 
4 


> K(2g 2)4 : 


This can be rewritten as 


where the parenthetical term is at least O from a previous lecture, and €. Is the equivalence classes of oo, so it is at 


least 1. This shows that the degree is large enough. 


And now applying Riemann-Roch in this simple case yields 
dim(Mx(T)) = &([div(f)]) = deg([div(f)}) — 9 +1, 


meaning that 


dim(M,(T)) = (k —1)(g — 1) 4 A E24 5 €34 ee 


We'll now talk briefly about cusp forms, but the story is basically the same: we just have that 


S(T) SL (awn _ >») 


The strategy is basically the same, except that we require divisors at cusps to have coefficients at least 1. This time, 
when we calculate the degree of |div(f)|, we lose the last €,, term in the boxed expression above, and we need k > 4 
so that the parenthetical term is now a strict “larger than 0.” (For the k = 2 term, [div(f)] —}>; x; is just the canonical 
divisor div(A), so the linear space has dimension g.) 

Finally, let's look at the k < 0 case: Mo(X(I)) must be constant by Liouville’s theorem, and So(X(T)) = {0} is 
the only cusp form. Notice that for k < 0, if there is any function f € M;,(L), then f!7A~* would be in So(¥), so 
indeed we also just have M,(T) = {0} for all k < 0 (which implies that the spaces of cusps forms are also zero). 


So all of our results can be summarized in a compact form: 


Theorem 230 


For even k, we have 


= Dip) 
dim(M,(T)) = 41 
0 


(k—1)(g—-1) 
dim(Mx(T)) = 4 g 


0 


This theorem Is nice because it gives us results in terms of congruence subgroups which we know how to calculate. 


And note that when T = SL>(Z), we already have a similar result which can be easily derived by the results we know. 


Example 231 


We can prove that 


Ma(lo(p)) = S2(To(p)) B CG2,p 


by showing that the dimensions add up. 


Basically, So is just missing a dimension of 1 from Mo, and note that CG2p = Go(T) — pGo(pT) is an element of 


Mo but not an element of So. 
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Example 232 


For all k even such that k(N +1) = 24, we have dim(S,(fo(N))) = 1. 


This in particular means that S,(Fo(N)) = Sx(Fi(N)) = Che, where 
k(T) = n(7)n(NT)* 
for the Dedekind eta function 7. In particular, this means that 
Mo(To(11)) = Cho 6 CG2p : 


we can find some very explicit descriptions of modular forms. 
Finally, we have the isomorphism 
Sx(SLo(Z)) = So(To(p)), 


where k = p+ 1 and pis an odd prime. And when p = 11, we have the simple isomorphism where we multiply by the 


; @: 
function . 


Diamond and Shurman 3.6 — Vanshika Jain 


We'll prove the dimension formulas for the odd-k case here. Here’s the result we're trying to show: 


Theorem 233 
Let [ be a congruence subgroup of SL5(Z), and let k be an odd integer. If —/ ET, then M,;(T) = S,(T) = 0. 
Otherwise, let g be the genus of X(I), €3 be the number of elliptic points of period 3, e%9 be the number of 


regular cusps and e"" be the number of irregular cusps. Then 


(1G) |e ley cee 5 ch kee 


dim M,(F) = he 
< 


1) 1) AS2 ered 4 Ant eitt > 3 
k<0 


dim S(T) = 


When k = 1, if ef9 > 2g — 2, then the dimensions of M,, S; are e€9/2 and 0 respectively, and otherwise, they 


are at least ef€9/2 and equal to ef9/2, respectively. 


We'll use the following tools: 
+ The fact that dim(M;,(T)) = £(|{div(f)]|) still holds for odd k. 


+ We have an isomorphism of vector spaces a,(T) > 2®"/2X(I)). In particular, we can take f € A;(F) and 


consider f2 for n = 2k. 


First, note that M,(I) = 0 for all k < 0, so S,(F) =0 for all k < 0 as well. And if k is odd and —/ €T, then 


f(r) = (FI-4k)(r) = -F (7), 


which implies that the whole space A,(7) = 0, which means that M;(T) = S,(P) = 0. 
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From here on out, & will be odd and positive, and we can assume —/ ¢T. Recall that X(T) can have both regular 
1 oA 
and irregular cusps: a irregular cusp 7(s) € X(I) is one where the group (a~/Ta) is generated by — E i where 


a(oo) = s. (The cusp is regular if there’s no negative sign.) Then we have that the order of vanishing is 


Vs(f)/2 irregular cusp for s 
Uns) (F) = : 
Us(F) otherwise 
for a cusp m(s) € X(T). (Basically, this is because the Fourier series width is 2h instead of just fh.) Also, recall 
that there is an isomorphism between automorphic forms and differentials when k € N is even (see above). We'll use 
that isomorphism now: take w to be the differential w(f*), which pulls back to f(7)?(d7)* on H. We'll also define 
{x3,:}, {xi}, {x/} to be the period 3 elliptic points, regular cusps, and irregular cusps, with orders €3, €©9, and el", 


We'll similarly define the formal divisor 
. 2 
div(dr) = : 33. > Xj > xe 


Note that because —/ ¢ I, there are no period 2 elliptic points, so this is consistent with last time’s definition. Replace 


f and k from the last lecture to show similarly that 
div(w) = 2div(f) + kdiv(dr), 
and now we can combine these two to find that 


. Des k k k 
div(f) = 5 div(w) > 373.4 :> 5X4 »; 5X 
i i i 


Taking pointwise floor, the order of vanishing is integral everywhere except at x3,;, xj, x/. For the elliptic points x of 


order 3, we know that we can write the order of vanishing as 


Jj 1 j—k 
w(AYamt+s = 5 Mx(w) = m+ gd 


but j = k mod 3 because $Vx(w) is integral, so the integral part of the order of vanishing is just 5 v.(w) + [4]. Similarly, 


we can find the integral parts at x; and x/: these have additional terms of $ and 45+ (plus the original 3vx(w)), so 


dim(M,()) = £(|dim(F)]) = ¢ (Jam Ds 5 x3; 4 2 = dy Ste), 


Our goal is to use a corollary of Riemann-Roch, so we want to get a bound on the degree of the divisor here (specifically, 


we again want it to be at least 2g — 2). But 


Eco 
2 


deg(|div(f)|) = k(g—1)4 5 €34 weft | ae ey 2) (9 14 


5 ) + (29 2) >2qg-2 


for all k > 3, so now the simpler form of Riemann-Roch tells us that 


: = \ k \ k reg | 7 irr 
ala F))) = (k= 1g 1) +] 5 fea + Sel + SAU, 
as desired. For cusp forms, we'll look at functions of the form fof € A,(T): note that at a regular cusp, 


Ve(fof) > 0 =} vx(fof) > 1 = vl) + vx(f) -1> 0, 
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and at an irregular cusp, ; 
Vx (fof) > 0 <= > vy(fof) > vx(fo) + v(F) — 5 > 0. 


(this is coming from the order of vanishing from earlier in the lecture). So we want to measure the dimension 


£ (issn -Sox-S) 31) ; 


and we just do similar calculations to get the result in our main theorem. 
To summarize, we used the isomorphism with f? instead of f, which gives us our differential. This allows us to 
calculate an explicit formula for the divisor div(f), and then we do some case-by-case analysis to get the integral parts. 
We'll now prove that A,(T) does contain a nonzero f when k > 0 is odd and —/ ¢ IT. Take a nonzero differential 
d € 1(X(P)) — for example, we can consider the divisor that pulls back to /’(T)dT. Now take an element xo € X(T) 
— by Riemann-Roch, the degree of the divisor of A is 2g — 2, so 


div(A) — 2(g — 1)x € Div’, 


where Div?® is the set of divisors of degree 0. (In general, Div’ denotes the divisors of nonzero meromorphic functions 
on X(T).) Abel's theorem tells us that we have a mapping Div’/Div’ > C9/Ag: let x + Ag be the image of the 
element div(A) — 2(g — 1)xp + Div’. If a divisor D € Div® has that D + Div’ maps to & +g (where z € C9), then we 
know that 

2D = div(A) + 2(g — 1)xo + div(f) 


for some f € C(X(T). We can rearrange this to 
div(fA) = 2(D + (g — 1)xo), 


and remember our goal now Is to find a automorphic form f which is weight-k and nonzero, and we'll do this by finding 
something of weight 1 and raise it to the kth power. Let f be the function such that f on X(0) pulls back to 


f(T)dT on H. This is nice because we know what dT is: we find that 


div(f) = div(fA) — div(dr) = 2(D + (g — 1)xo) 4 XSi t Sox t+ Sox. 


We know that at elliptic points of period 3, v;(f) is 3Vn(r) (F) and we have a 2 in the numerator for this term, so 


this is even. Since f is weight-2 invariant, there exists a function f, such that = f and fily]1 = x(y)fi, where 


x : F — {+1} (so it’s an automorphic form up to some negative sign). Let [’ be the set such that x(y) = 1. If the 
index is 1, then we're good — otherwise, we find a function fo which is negative at the correct spots, and multiplying 


f, by f2 means we have our element of A,(T), as desired. 


19 April 21, 2020 


Diamond and Shurman 3.8, 4.1 — Michelle Xu 


We'll cover cusps for congruence subgroups (explicit forms, cusps in relation to double cosets) and Eisenstein series 
(review of material from Serre) in this lecture: the topics might seem disjoint, but we'll connect them in the next 
lecture. 


Recall that a cusp is an equivalence class of QU {oo} under action by [: this is also denoted [\(QU {co}). We 
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know that there is always an element of SL2(Z) which maps oo to r for any rational r, so there is only one cusp for 


SL»5(Z). However, this won't be true with our congruence subgroups. 


Lemma 234 


Let s = 2 and s’ = & be elements of QU {oo} such that gcd(a, c) = gcd(a’, c’) = 1 are coprime. Then 


s' = (s) > E by Et 
Cc C 


Proof. When c, c’ are both nonzero, we know that if 7 = ° ‘ , 


/ (s) a  pat+qc_a 
Y co u6ratte) OU 


But both fractions are in lowest terms because we can invert y~!, so we must have a’ = pa+qc,c’ = rat+ tc as 
desired. 


If c =0, then a= 1 based on our condition, so z = — Here, we have gcd(p, r) = 1 because pt — qr = 1, which 


means that a’ = p,c’ =r. (And c’ = 0 is similar.) 


We'll now focus more on the congruence subgroups: 


Lemma 235 
If y € T(N), then 


Proof. Here, ¥ is the identity matrix mod WN, so left-to-right is clear. For the other direction, we start with the specific 
case a = 1,c = O, which means that a’ = 1 mod N. Bezout tells us that for any x, y € Z, we have m,n € Z such 
that mx + ny = gcd(x, y), and in general we can make mx + ny equal to any multiple of gcd(x, y). In our specific 


case, because a’ and c’ are coprime, we can find 6, 6 such that 


l-a 
/ fe 
aB+cé= yO 
and this means we can use the matrix 
a BN 
= ET(N). 
! y 1+6N ”) 


In general (when we don’t have (a, c) = (1,0)), we know that there exist b,d € Z such that ad — bc = 1, so using 


; a b 
the matrix @ = 
1 a _,/4 1 
a = => a = mod N, 
0 Cc Cc 0 
ce! c 


and use y = a7y/a~* because ['(V) is a normal subgroup of S/>(Z). 


so we can first find y’ such that 
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We'll use these lemmas to prove more about our cusps: 


Proposition 236 
Take the same notation as above, so s = = 


correspond to the same cusp: 


T(N)s' =T(N)s <> ° 
G 


Ee = ee ° 
iG 


To(N)s! =Po(N)s <> Ee 
G 


for some Jj, y. 


q! 
i] = ay 
Cc Cc 


For [3(N), any matrix can be written in the form 


1 * 
mod N = 


We decompose the congruence subgroup as 


a 


by the first lemma, and then we can apply the second lemma. 


cN dN+1 


ti 
‘ 


aN +1 ae 


r1(N) =U) 


and we can now apply the results we already know for [: the left hand side is equivalent to saying that s’ € [4(N)s, 


Lj 
so s’ €T(N) ‘i : s for some j. But then we can just multiply out and apply the lemmas for (NV), and this tells us 


gives us the desired result. 


For To, we'll decompose again: 


To) = r() 


xX k 
N yl. 


where we take the union over y relatively prime to N, and we need xy — Nk = 1. And the rest is a similar calculation: 


xXatke 


k 
the left side is equivalent to s’ € [1(N) : | x for some y, which is equivalent to [y(N)s’ = [1(N) 


applying the result for [4 tells us that 


_|xatket+sye 
yc 


mod WN, 


and multiply a factor of y on both sides to get the result. 


This means we have explicit conditions for checking whether two elements in QU {oo} are the same cusps, 


this allows us to count the number of cusps for each subgroup. 


Instead, we'll discuss these cusps in a group-theoretic manner: 
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Na+yc' 


and 


and 


Definition 237 
Let G be an arbitrary group with subgroups H;, H>. A double coset of G is a subset of the form H, gH», and the 


space of double cosets is denoted H,\G/Hb. 


These double cosets are disjoint, and we can therefore decompose as 


G =|) Migh. 
gER 


Proposition 238 
Let [ be a congruence subgroup of SL2(Z). Let P be the parabolic subgroup 


p-feft sea} 


a 
Then the map between ['\SL2(Z)/P and cusps of T is a bijection: F 
G 


b 
i P maps to ['(a/c). 


(We won't prove this, but it’s a nice way to study cusps and is more general.) 


We'll finish by reviewing the Eisenstein series: recall that we define these to be 


G(r) = y aExa 


(c,d)EZ 


where we sum over all pairs of integers except (0,0). We also define the normalized Eisenstein series F,(|tau) = 
Gx(T) 
2¢(k) ° 


We'll be rewriting the Eisenstein series in nicer forms now: 


Lemma 239 
We have 


Gdr=ckh) SD 


LN 
gcd(c,d)=1 (cr + d) 


Proof. Reorganize the order of summation by the gcd (absolute convergence allows us to do this for k > 4): 


1 1 
ee 3 Sy (cr + d)k - oa Ss (cr + dk 


n=1 gcd(c,d)=n gcd(c,d)=1 


which is what we want. 


Lemma 240 
Let P, be the positive part of the parabolic subgroup of SL2(Z) (P, except without the + 


Edr=5 itr) 


yEP+\SL2(Z) 


where j/(7, 7) = (cT +d). 
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Proof. Take the previous lemma and divide to find 


1 
a ee 
gcd(c,d)=1 (cr + d) 
We know that c,d have no common divisor, so we can index with SL2(Z) as long as we remove duplicates of the 
same (c,d). And this requires proving that the sets of SL2(Z) with the same (c, d) are exactly the same as the orbits 
of P,. One direction is easy — we can check that multiplying SL2(Z) by Py preserves the (c, d) pair. For the other 
‘ , . . a, by a bo 
direction, if we consider both and , then 
c d c d 

by, — bo)c 
ag ee : q 2) : 


and indeed n= Py be means we have a perfect indexing of (c, d) pairs. Substituting everything in gives the result. 


This in particular show that Ex(7) is weakly modular of weight k: this is because we act with some + and use 
properties of the factor of automorphy j(7, 7), as well as the fact that multiplying by -y’ onto some y € Pi\SL2(Z) 
gives another element of that equivalence class. 

We found in Serre that 

Mk = Sk BCGx, 


which motivates the following definition in general: 


Definition 241 


The weight-k Einstein space €,(1) of a congruence subgroup [ is the quotient space M,/Sx,. 


We will prove later that this is a subspace of M, and also complementary to Sx. 


Proposition 242 
The dimension of the Eisenstein space €,(T) is €4 when k > 4 is even, e&9 when k > 3 is odd and —/ ¢ T, 
Eoo — 1 when k = 2, e%3 when k = 1 and —/ ¢I, 1 when k =0, and 0 otherwise. 


This is basically just casework from previous lectures and subtracting dimensions. 


Diamond and Shurman 4.2 — Andrew Gu 


In this lecture, we will generalize the construction from last section to construct multiple Eisenstein series for congruence 
subgroups [, and we'll get an explicit basis for the Eisenstein space. Throughout this lecture, overline means reduction 
mod N, PY = P, MT (N), k > 3 isa positive integer, and V € (Z/nZ)? is a row vector of order N (in other words, 
gcd(c, d, N) is 1 for any lift v = (c,d) of V). Also, let ey be equal to $ for N = 1 or 2 and be equal to 1 otherwise 
(this is compensating for the —/ being in [(N)). 


Definition 243 


For any row vector V, define the Eisenstein series 


Eien oS) Ser edn”, 
(c,d)=v mod N 
gcd(c,d)=1 
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Proposition 244 


Let 6 be any element of SL5(Z) where the bottom row of the matrix reduces to (c,, dy) mod N. Then we can 


group matrices by bottom row: 


Maso ps = AGnor 


yEPN\T (NYS 


where j(¥, 7) is the usual factor of automorphy cr + d. 


We'll omit this proof for now. Our goal is to show that this series is an element of M,(F(N)): to do that, we 
need to show that it is holomorphic, weight-k invariant, and holomorphic at all cusps. The first property is basically 
identical to the N = 1 case (because of uniform convergence, the function is again holomorphic everywhere). 


To understand whether the transformation law works correctly, we consider the weight-k operator: 


Proposition 245 (Transformation law) 


For any y € SL5(Z), we have 


(Ela ae 


In other words, we just end up with another Eisenstein series corresponding to a different row vector. 


Proof. This is just a computation: recall that the weight-k operator is defined via 


(Flv]e)(7) = ily, T) “Ff (9(7)), 


so the left side is equal to 


ewaay* > Jee) 


yEPN\T(N)S 


which can be simplified via properties of / to 


=en >) id’yty*, 


EPN\T(N)S 


and relabeling indices, this is summing j(y", 7)~* over -y/ € PI\F(N)éy, which is exactly the right side. 


Corollary 246 


EY is weight-k invariant. 


Proof. For any y € T(N), VY = V (because ¥ is the identity mod N), so this follows directly from the above result: 


(Exile (7) = EP (7), 


as desired. 


It remains to show that the behavior at the cusps is what we want: we'll start by looking at the cusp oo. The 


only terms in the Eisenstein series that do not vanish are those where (ct + d)~* does not go to zero, so we must 


have c = 0 and |d| = 1, and this only occurs when v = +(0,1). N = 1 and 2 are special cases, because we have 
both (0,1) and (0,—1). When k is odd, these contributions cancel, and otherwise we get an extra contribution 
of $((—1)k + (—1)‘) = (—1)* from the two j(y,7) terms. Thus, as Im(T) —> oo, EZ(r) is only nonzero when 


V = +(0,1), and furthermore we must have either NV > 3 or k even. 
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Remark 247. This is part of why the normalization €y factor is nice: the values of our normalized Eisenstein series 


Ey are always integers at cusps. 


Now note that when k is odd and n € {1,2}, the Eisenstein space is zero (because —/ is in the congruence 


subgroup). So we'll assume that we're working with N > 3 and k even until we do our Fourier coefficient calculation. 


Proposition 248 


Ee is holomorphic at all cusps. 


/ / 


! b 
Proof. Let V = (c, d) be the bottom row of 6 € SL2(Z). If s = % is any cusp in QU {oo}, let a = y | € SLo(Z) 
Cc 


d' 


take co to s: then the transformation law in Proposition 245 tells us that 
Ex (s) = Exlalx(oo) = Ef (co), 


where the superscript can be written as (0, 1)da@. We know that this last Eisenstein series is only nonzero if 


a a’ —d 
01. => F = or mod WN. 
Cc Cc 


This means that E/ is only nonvanishing at the cusp (orbit) [(N)(—d/c) and is zero everywhere else. Thus the value 


TT 


(0, 1)da = 


is always finite, so we do have holomorphicity at the cusps. 


Note now that all cusps are regular, so €,(F(N)) has dimension €,,(F(N)) (since we're assuming k > 3 and N is 
not 1 or 2). Also notice above that each cusp can be associated with an explicit Eisenstein series which is only nonzero 


at that cusp, so we can use that set of Eisenstein series to create a basis. 


Corollary 249 
We can construct an Eisenstein series by summing over cosets: for any congruence subgroup [ of level NV, the 


sum over coset representatives 


gD Tadanh 


WET (N)\T 


is an element of M,(T). 


We'll now move on to computing Fourier series. Ey is normalized, but we want to instead look at the non-normalized 
series (where we don't require gcd(c, d) = 1) 


Uy 


Gi(r)= So (er+ay* 


(c,d)=v mod n 


when calculating these Fourier coefficients. We'll group our terms according to gcd(c, d), which is always coprime to 


N by assumption, and this yields 


Gir)= > SS) (er+d)*, 


n=1 (c,d)=v mod N 
gcd(n,N)=1 gcd(c,d)=N 


and now factoring out the gcd term in the inner sum yields 


= S- < S- (rad), 


n=1 (c',d')=n-!v mod N 
gcd(n,N)=1 gcd(c!,d')=1 
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Thus, we can write , 
= is 
Gr)=— SI OER (7), 
e n€(Z/nZ)* 
where 
1 
C2 (k) = S- me 
m=1 
m=n mod N 


is the modified zeta function for any n € (Z/nZ)*. (This reduces to the usual zeta function for N = 1.) We can 


similarly rewrite the E series in terms of the G series: 


Proposition 250 


We have 
B= Ye Ca wera 
né(Z/nZ)* 


for the function 


Cikw= Sp Bo 


mk 


m=1 
m=n mod N 


where f4(m) is 0 for non-squarefree m, and otherwise (—1)* if k prime numbers divide m. 


We prove this with a lot of calculation and a Mobius inversion: one main idea is that }7 4, 4(d) = 1 when m= 1 
and 0 otherwise. And now that recalling that we can choose a collection of vectors that represent each cusp once, 
giving a basis of {E}/}s: the result above now tells us that we can get a basis of {G/}s as well. 


To get a Fourier series, we'll need the following lemma which was proved in Serre earlier in this class: 


Lemma 251 

For any k > 2 and tT € H, we have 

So =C, y mk il Gu 
(INK 

= (T+ = 


(=2mi)* 
(kK-TT 


where g = e277 and C, = 


This now allows us to compute the Fourier series of G/ (remembering that we're summing over all nonzero (c, d)) 


Gy(7) = > Ue Ta eee 


etl 
c=cy deZ c=cy aez. ( aa aes 5F 


Notice that this does sum over all (c, d) = (c,, dy) mod N, and now we'll break this up by values of c. 


by rewriting as 


* c =0 yields the constant term of the Fourier series (the above lemma will show that all other values of c have 


no constant term, or alternatively we can think of taking T to +/oo): if ¢/ = 0, then this is 


54 
Doge 
d=d, 


and otherwise it is zero. 


+ For c > 0 (corresponding to points in the upper half plane), we can use our lemma with are in place of T. 
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This then evaluates to 


Ck ~ k-1,,dym.cm 
zy =a z= ye oe a, 
Se aez, ( +d)‘ c=ty m=1 
c>0 
where Ly = e27/N and g = e2/7/N. (The c and d, in the exponents can be understood if we expand out 


e2mi(cT+d)/N as in the lemma.) This can then be rewritten by summing over constant exponents of gy as 


= DY ot aieal, 
c=cy mln 


c>0 n/m=c 
m>0 


and we have a Fourier expansion for the c > 0 terms. 


* On the other hand, for c < 0, we can write cred td= ( cra d). The first term in the parentheses, 


ae now does exist in the upper half-plane, and applying the lemma gives something similar: we have 


1 1 Ed ra 
Wi 2 2a aga cay a EL 


c=cv dEZ c=cv m=1 
c<0 c>0 


and we can now rewrite this sum again by summing over the exponent of qy: if we flip signs to force c to now 


be positive, m must be negative, and the (—1)* term simplifies with the (—1)*~ from the m exponent. We end 


Cr k-1,, dy 
i aD Dae 
c= mln 


c>0 n/m=c 
m<O 


up with 


where notice that we now sum over the negative divisors of n instead of the positive ones. 


Putting all of these together, we get the full Fourier expansion for Gy: the qj)-coefficient of G/(T) for a point V 


of order nis 


where 
ov_1(n) = S> sgn(m)m* we. 


As a final note, this Fourier expansion tells us that the coefficients are polynomially-bounded for any a and thus 


we find directly that we have a modular form. 


20 April 21, 2020 


Diamond and Shurman 4.3 — Andrew Lin 


Recall that last week, we introduced the Eisenstein series associated to specific congruence subgroups [. We were 
able to describe Eisenstein series that live in M,(I), and we also found that we could describe the cusp “Eisenstein 
space” 

Ex(T) = Mx(0)/Sk(T), 


computing explicit basis elements of €x(T(N)). The point of this section is to start giving some background for looking 


at modular forms for [,(N), as well as cover some general number theory concepts. 
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Throughout this, we will let Gy denote the multiplicative group (Z/NZ)*: recall that this group has order #(N), 


and it consists of the residues mod WN that are relatively prime to N. 


Definition 252 


A Dirichlet character mod N is a homomorphism x : Gy > C*. 


Here are a few useful properties we can extract from this definition: 
+ Because Gy is a finite group, all elements have finite order, so x(7) is a root of unity for all n € Gy. 


¢ The set of Dirichlet characters form a group, called the dual group Gy of Gy. Indeed, the law of composition 
for two characters x, w is the product character (xwW)(n) = x(n)wW(n). This group has an identity element ly, 
which just sends everything to 1 (known as the trivial character), and the inverse of a character x is x-! = X, 


which takes the complex conjugate of x(n) for all n. 


« Recall that for prime numbers p, the multiplicative group (Z/pZ)* is cyclic. Then the dual group is also cyclic: 


each character is determined by which root of unity a primitive root g is sent to. 


This last result can be generalized to all integers, not just primes: 


Fact 253 


The dual group Gy is isomorphic to Gy (and in particular, there are ¢(N) Dirichlet characters mod N). 


The proof can be found in Serre: the main idea is to induct on the order of the group by considering subgroups of 
Gy and noting that for any subgroup H of G, we can decompose G as the product of H and G/H (and same for the 
dual groups). However, there is no canonical isomorphism in general (for example, we need to pick a primitive root 
arbitrarily for the prime n case). 


The next result generalizes the duality between the two groups: 


Proposition 254 (Orthogonality relations) 
We have 


neGy 0 otherwise 


all 


> xp = | 


eC 0 otherwise. 


These may look familiar (for example) from 18.702. 


Proof. We'll just prove the first result — the two proofs are basically the same. If x = 1), then we are just adding up 
the total number of elements in the group, which is clearly $(n) by the above fact. Otherwise, there is some element 
y such that x(y) #1. Then summing over the whole group, 


S> x(n) = S2 x(ny) = SO x(n)xly), 


né€Gn n€Gn né€Gn 


which means that 


(x(v) — DCS5 x(n) = 0, 


n€Gy 
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and the result follows. We do the same thing for the second relation, except we multiply by a character instead (and 


use the fact that we still have a group). 


We'll now say a bit more about this idea of building up characters from smaller ones: suppose that d divides N, 
and we have a Dirichlet character x%q mod d. Then we can construct a Dirichlet character mod N in the natural way 
by letting 

xn(n mod N) = xg(n mod d), 


but this is only always consistent if d divides N. We denote this via xv = X° Tyg, where 7y,q takes an element of 


Gy and reduces it mod d. 


Definition 255 


The conductor of a Dirichlet character x mod N is the smallest d|N such that we can write xX = Xq ° Tq for 


some Xq. 


Because of the simple group structure here, we can equivalently say that the conductor is the smallest d such that 


x is identically equal to 1 for all elements in the normal subgroup 
K={n€Gy:n=1 mod d}, 
which is the kernel of the projection map. 


Example 256 
The conductor of the Dirichlet character y mod 12 sending 1, 7 to +1 and 5,11 to —1 is 3. 


Definition 257 


A Dirichlet character mod WN is primitive if tts conductor is N. 


For example, w “only cares about the element mod 3,” so it is not primitive. 


We can extend the definition of x to all integers by turning it into a function x: Z—> C: 


x(n mod NN) gcd(n,N) =1 
x(n) = 
otherwise. 
This function is indeed consistent with the original x we described, and it is still multiplicative, although it is not a 
homomorphism because 0 is not invertible in C. (Keep in mind that this also means the trivial character 1, is no 
longer just equal to 1 everywhere, and for example x(0) is only nonzero when N = 1.) Then we can restate the 


orthogonality relations in a more helpful form for later use, though they are not really any different: 


P(N) x=1n 


N-1 
do x(n) = , >> x(n)= 


7=0 0 otherwise xeCy 


o(N) n=1modN 


0 otherwise. 
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Definition 258 


The Gauss sum of a character x mod N is 


N-1 
9(x) = >> x(m)uh, 
n=0 


where uy = e27'/" is the standard Nth root of unity. 


For example, the Gauss sum of wW above is zero, which does not tell us very much useful information. 


Proposition 259 


For any primitive character 7% mod WN and any integer m, 


N-1 
S> x(n) up” = x(n) g(x). 
n=0 


Notice the special cases when m = N and m= 1: the sum of x Is zero for a primitive character. 


Proof. When gcd(m, N) = 1, this is simple: multiply the left side by x(m)x(m) = 1 to get 


N-1 
x(m) $2 x(n)x(m) ua”, 
n=0 


and now the sum is still summing over all integers mod WN, so it is g(x), recovering the desired result. 

Otherwise, the right hand side is zero because X(m) = 0. Let d = gcd(m, N), and set m/d = mand Wd =N. 
Now rewrite the sum based on the residue of n mod N’: for any given residue n’ mod N’, the exponents for those 
corresponding terms look like aaa for some k. But d|m, so the kN’ terms disappear and we're just left with an 


exponent of n’m. Noting that (uy)? = wp, this leaves us with 


N-1 N’-1 N-1 
Soxtmun=S>~ So x(n) | war”. 
n=0 n’=0 n=0 


n=n' mod N’ 
To show the desired result, it suffices to show that the parenthetical term is zero. First of all, consider the sum over 


K (that is, when n’ = 1): since x is primitive, the value of x over this kernel should not be trivial, so by a similar 


“sum-over-group” argument, the sum over K is zero. And now we can sum over any other coset n’K and get zero by 


multiplicativity, and we're done. 


Corollary 260 


The Gauss sum of a primitive character has magnitude VW (in particular, it is nonzero). 


Proof. We know that 


N-1 
9(x)G(x) = 9(x) $5 X(m) wR”, 


m=0 
and bringing g(x) in and using the above proposition yields 
N-1N-1 N-1 N-1 
- x(n) aR” = 3 xm) SO wy”, 
m=0 n=0 n=0 m=0 
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where we've swapped the order of summation. But the sum over m (by a familiar argument) is zero unless (n — 1)m 


is a multiple of N, which happens exactly when n = 1. And that means the sum just reduces to Sa, 1 = NA, as 


desired. 


Proposition 261 


If N = 1 or 2, then all Dirichlet characters have x(—1) = 1. Otherwise, there are an even number of Dirichlet 


characters, and half of them have each of x(—1) = £1. 


Proof. \t's easy to write down the only Dirichlet character for N = 1 and 2: it’s the trivial one. Otherwise, x(—1) 
must either be —1 or 1 (because x(—1)? = x(1) = 1), and there does exist a character that takes —1 mod N to —1, 


because we can lift a nontrivial character either mod p for an odd prime p|N (sending a primitive root mod p to a 


primitive root of unity) or the nontrivial character mod 4 when 4|N (which sends +1 to +1). So we have a surjective 


homomorphism from the group of characters Gy to {+1}, and the correspondence theorem yields the result. 


This is most of what we'll need for the upcoming theory of Eisenstein spaces, and we'll finish with an important idea 


of decomposition: we'll construct subspaces of modular forms M,(T4(N)) corresponding to the congruence subgroup 


1 * 
of matrices y = ; ] mod N. 


Definition 262 


The x-eigenspace of a Dirichlet character x mod WN is 
Mi (N, x) ={f © Mg(Pi(N)) : Flylk = x(dy)f Vy € To(N)}, 


where dy is the lower-right entry of 7. 


This is clearly a (linear) subspace of M,(F1(N)). 


Lemma 263 
M;(N,1) = Mx(Po(N)), and M,(N, x) is trivial unless x(—1) = (-1)*. 


Proof. For the first statement, note that 1(d,) = 1 because gcd(d,, N) = 1, and the space of functions left invariant 
under [y]x for y € Fo(N) is exactly Fo(N). 

For the second statement, the statement is vacuously true for N = 2, and otherwise —/ is an element of Fo(N). 
Then f[y]x = (—1)‘f, and the result follows. 


Proposition 264 
The vector spaces M;(Pi(N)), Sx (Fi(N)), and €,(F1(N)) all decompose as direct sum of x-eigenspaces: for 


example, 


My(Ti(N)) = GMa (NX): 


x 


and analogously for Sx and Ex. 
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Proof. The proofs are the same for the first two spaces, and then quotienting yields the result for the third (since 
(A@® B)/(C 6 D) = A/C @ B/D for spaces C C A,D C B). We'll only present the first proof. Define the weight-k 


operator 
a b 
[le al) 


for any 6 = d mod N. This is a well-defined, multiplicative operator called a diamond operator, but we need more 


theory from Chapter 5 to see why this is indeed well-defined. Now for each character mod N, we can define 
1 = 
™ =a Dd, XA Md). 
G(N) de(Z/nZ)* 


This is a linear projection operator: applying 7, twice as a double sum and using multiplicativity includes x(d) for 
each element @(N) times. To understand where this projection operator sends our modular forms, if we take any 
f € Mx(Fi(N)), then 


1 -1 
MF LY] = FW Pa M0 (d) fly] x 


still sums over d because the lower-right element dy of [y] is invertible: 


= oH Sx dy)x(d)-2x( dy) 1d dy) F = [x(a mF I 
dé(Z/nZ)* 


so we do project into. M;(N, x) (1 satisfies the charactersitic condition), and when f is already in the x-eigenspace, 


Tx acts as 


1 4 _ 1 = = 
wel = Beqy x(a DF = Seay Xt) x(d)f =f, 


sO we project onto the subspace. 


To finish, note that the sum over characters 


1 -1 
2% = 2, GH) S> x(d)71(a) 


dé€(Z/nZ)* 


is zero except when d = 1, in which case we just end up with am o() - (1), which is the identity operator on [. 


This means the subspaces are spanning. Finally, for any two distinct characters x, x’, 


o _ 1 a -1 
ome = a, Do x(a (eet 


by multiplicativity, but summing over any constant c = de yields a sum 
> x(d)*x'(a)x(c), 
d 


which is zero since x~!x is not the identity. Thus we've shown that the subspaces are disjoint: nothing projects down 


to both subspaces, and we're done. 


Diamond and Shurman 4.4 — Natalie Stewart 


The basic structure of this lecture is to go through a laundry list of functions to define and use: we'll be discussing the 
gamma function [, zeta function ¢ (and associated €), L-functions L(s, x) and the modified zeta function ¢!'(s). The 


point is to provide meromorphic extensions to all of C for these functions, and characterize the poles and important 
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explicit values. 


Definition 265 


The gamma function 


r(s)= [ eae ade 
0 


This is an “extension of the factorial function:” we have that [(n+ 1) = n! for integers n, and in fact whenever 


t 


Re(s) > 0, the integral converges because we have exponential decay from e~*. The main point is that we have a 


functional equation for all Re(s) > 0 of the form 


(s+ N) 


USS 65) 0GnS 


which we prove via integration by parts of the identity [(s +1) = sl'(s) (we lower the power of the polynomial term). 
The point is that we can use this to extend T to a function on all of C, except that we avoid points where we have to 
divide by zero: 

r(stn+t+1) oe 1)" 
ns-(s+1)::*(stn—-1) al 
So when we meromorphically continue [ to the domain where Re(s) < 0, we will have a simple pole at each integer 
(= ti 


lim bs + Mi (s) = = lim 


. There’s also another alternative formula that we can use: 


—n € Z<o with a residue 


Proposition 266 
r(s)f(1—s)= 


T 
sin(7s) * 


In particular, this means that [(s) is nonvanishing on all of C (we just need to check explicitly at the positive 


integers where we have poles, where we already know the value of [), so * is an entire function. 


Proof sketch. Break down both sides into series: we use the fact that 


lim (1 — ) =e, sin(ws) =7s I ( = =) 


(the latter is the Euler product, which “tracks the zeros of the sine”). Then breaking down the integrals means that 


n t n 
I, = i (: _ ) te-ldt 
0 n 


converges to ['(s), and these are easier to integrate by parts because we are integrating polynomials. We find that 


we want to show 


1 
== atl 37 / me) 


In(s)In(—s) = 


and taking n — oo will give us what we want. 


If we define the constant C, = aia we can verify that T (5) = ,/m (change variables to a Gaussian integral), 


and in general we can compute at half-integers that 


m-G-W/PT((1 — k)/2) Cx 
mw k/2P(k/2) a 
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This is shown by directly manipulating products: 


Cy (—2mi)k _ ak ak (—1)k/2 
2 2W(k) = 2(1-3---(kK—1)(2-4---(k—2))' 


and then we distribute factors of —2 into the parentheses on the left and +2 into the parentheses on the right. 
(Everything works out — the point is to set up a parity between powers of 7m and values where [ is evaluated.) 


We'll now move on to another function: 


Definition 267 


The Riemann zeta function is defined as = 


(= 5 = 


n=1 


for any Re(s) > 1. 


The relevant complex analysis argument shows that this is indeed absolutely convergent on this half-plane, and we 


also have the product formula 


(s)= TJ a-psy7 


p prime 
(we can basically multiply each factor for a prime p over to the other side, and that accounts for all of the denominators 


that have prime powers of p). We'll define in section 4.9 that if we define the function 
E(s)=ar Tr (s/2)C(6), 


we have the functional equation €(s) = €(1 — s), and the only poles of this function are at s = 0,1. And this tells us 
that we have a meromorphic continuation of ¢ to the entire complex plane, such that the only pole is at s = 1 (with 
residue 1) and we have simple zeros at s = —2, —4, —6,--- (and possibly other points as well). 


The main argument we'll make about these is that we can use our coefficients C, from earlier to get all the negative 


integer values of ¢: recall that ¢(k) = Be where B, is the kth Bernoulli number, and that tells us that 
G—s)/2 ((1— s)/2 
T Ss 
(s) = Be Ei 8), 


m—s/2T(—s/2 


so €(1—k) = oe = Be 


We can also generalize this zeta function using the Dirichlet character: 


Definition 268 
For a Dirichlet character x mod WN, define the L-function 


co 


he = ys x(n) 


ns 
n=1 


for Re(s) > 1. 


A similar product argument shows that 


L(s,x) =[[Q-x(e)p*)y3, 


p 
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and this means that, for example, 


L(s,1y) = C(s) [J —p*) 


p|N 
(remember that ly is 1 whenever our integer is relatively prime to N and 0 otherwise). So at least for the identity 
character 1), we can meromorphically continue the L-function: L(s,1,) will have a meromorphic continuation to the 
entire complex plane, where there is only a simple pole at 1 with residue Hpi —p*)= oN) 
But whenever x is not the character 1, we actually have better behavior: we can find functional equations for 
non-principal Dirichlet characters, and the extensions will turn out to be entire. So the key point is that ¢ and L(s, x) 
are meromorphic with some nice properties. 


We'll quickly talk about the last family of functions here: 


Definition 269 
The modified zeta functions are defined via 


[o-e) / 


MO ee OS ae 


m=1 m=n mod N 
m=n mod N 


(where we're allowed to sum over negative m in the second sum). Also, define 


u(m) 


We care about these because they showed up in our equations between our Eisenstein series Ey and Gy, and in fact 
this gives us a relation between C? and C?(k, wu). We can write our modified zeta functions as a sum of L-functions: 
we have that 

any Do MPLS. x) = C28) 
xeGn 
by using the orthogonality relations, and therefore ¢! also has a meromorphic continuation to the complex plane, 
which again only has a simple pole at 1. This gives us the continuation of ¢!'(s, ) by defining linear relations between 
vectors of Ef and Gj and using a fair bit of linear algebra: the punchline is that we can write ¢"(s, 4) in terms of 


L-functions, so we can meromorphically continue it as well. Finally, 
e's) = C1 (s) + C1) cp"), 


so we can find a meromorphic extension, and in fact ¢” is entire (we just need to check that there isn’t a pole at 


s = 1). One value that we will care about later on is 


ny = MB cot 1 
C"(1) = 47 + 7, cot ar 
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Diamond and Shurman 4.5 — Nikhil Reddy 


Recall that we defined the (original) Eisenstein series G,(7) and Ex(7) by summing i ae over all integers (c, d) 


(except for zero), and we can also extract out a factor of ¢(k) by only considering coprime (c,d) to define the 


110 


normalized Eisenstein series. We checked that both of these converge absolutely when k > 3, and we'll assume that 
throughout this section. (Notice that G, and E, are both identically zero in this definition whenever k is odd, so these 
are really only defined for even k.) 


Our next idea was to define the alternate series 


_ 1 7 1 
ee So a Se ee 
(c,d)=v mod N cor - d) (c,d)=v mod N (cr 3 d) 
gcd(c,d)+1 


for a vector V of additive order N. (Remember that the overline means we reduce mod NV.) These play nicely with 
each other as well, and notice that taking N = 1 gives us the original Ex and Gx. 


We also proved that the G/s were a linear combination of the EVs, given by 
ae | <1 
Ge=— DT GQUWER (1), 
né€(Z/nZ)* 


where €1.(k) is the modified Riemann zeta function. We'll see a return of the Dirichlet characters as well: these are the 
homomorphisms from Gy — C*, and the important idea is that a character has a conductor (which is the periodicity 
as a function Z — C): a primitive character is one with condutor N. (And remember that overline over a character is 


a conjugate, not a reduction mod N.) Recall that when x is primitive, we have 


N-1 
So x(n) un” = x(m) g(x) 
n=0 
and therefore the Gauss sum is nonzero for a primitive character. This is helpful for us to to decompose the space 


Mu(Ti(N)) = GB Mu(N, x) 


x 


into x-elgenspaces: we're going to be studying those eigenspaces in this lecture here. 
We'll keep doing a bit more review so the calculations make sense: last week, we showed the weight-k operator 


satisfies 
(Exlyle)(7) = Ex"(7). 

Since the GYs are linear combinations of multiple E's, the same equation holds for them as well: 
(GklIk)(7) = Gf7(7). 


we'll use this throughout the lecture. 


Proposition 270 


The Eisenstein series eo for any d € (Z/nZ)* lives in Mx (Fi(N)). 


Proof. Any element y € [1(N) acts as the identity on vectors (0, d), so the above equation tells us that 


(Gilylk)(7) = G(T), 


which is the characteristic property for a modular form in M,(T1(N)). 


Our goal from here will be to construct a basis of the Eisenstein space by using pairs of primitive characters. Our 


first step will be to try to build an element of [9(N) from our elements Gon: 
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Proposition 271 


The Eisenstein series 


dé(Z/nZ)* 


is an element of Mx(To(N)). 


Proof. Notice that applying the operator [y], means we're just summing over the group in a different order (using 


dd, instead of d, so we have the same sum again when we apply that operator. 


Proposition 272 


The Eisenstein series 


SX x(aee* 


dé(Z/nZ)* 


is an element of the x-eigenspace M;,(N, x). 


Proof. We're still bringing the dy into the vector, so the result under the weight-k operator is 


Rae 
dé (Z/nZ)* 
Multiplying in a factor of x(d,)x(dy) = 1 gives us a x(dy) that comes out, and the rest is “summing over the group” 
again. Then we end up with 


x (dy) suMge (zany X(A)GOO™, 


which is the definition of the x-eigenspace. 


Now we can do the full construction as follows: start with two Dirichlet characters yw mod u and ¢ mod v, such 


that 
“uv=N, 
» The product character satisfies (wx)(—1) = (—1)*, and 
* @ is primitive. 
Then the definition we're using is 


u—1 v—1 u— 


GPT) = SSS v(e)o(Dalr (1). 


c=0 d=0 e=0 


This is secretly pretty familiar: if we plug in u = 1, the sums over c and e disappear, and we’re just left with the series 
we just worked with in the ¢-eigenspace in the proposition above. (And we define Gr to be zero when V is not of order 
N.) 


Let's prove this is in the ¢-elgenspace: notice that 


(cv,d+tev)y=(c'v,d’+ ev 


where we change the coordinates via c’ = cay, d’ = ddy, e’ = (e+ c’by) dy, so we might as well sum over c’, d’, e’ 
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instead. So the action under [y]« yields 


u-1 v—-1 u-1 


GE (Tle = 52 SS So wlcotanaler 24 (1). 


c’=0 d’=0 e’=0 


and now we just need to change the c and d in the Dirichlet characters: we use the fact that 


Y(c)O(d) = P(ay)W(c')O(dy)6(0), 
and becaus a,d, = 1, we can write w(a,) = W(d,), so this just means we end up with 
GP lk = (Wo)(dy)GP*. 


This gives us our first result: 


Theorem 273 


The Eisenstein series gee lives in Mx(N, W@). 


Remember that we have a Fourier series for Gi, where the non-constant part looks like 


Ck " 
ae > eon(mm” eg. 


mn>0,n=cy mod N 


We can plug this into the formula for our Eisenstein series Gre to get a quadruple sum 


u-1v—-lu-1 


v(c)6(d) YS > sgn(m)m tp Orem grin 


c=0 d=0 e=0 mn>0,n=cv 
which we can group by summing roots of unity, using that uv = N: 


u-1lv-1 


S-w(c)6(¢) SD sgn(m)m*tygr (Sas) an” 
d=0 


c=0 mn>0,n=cy e=0 


But the inner sum of roots of unity is zero unless u divides m, and also nis a multiple of v by definition. So we can 


get rid of one of the summations by replacing m — um and n—- vn to get 


C u-1 v-1 _ 
= 48S usd) — sani) tusng™, 
c=0 d=0 mn>0,n=c mod u 


and now because @ is primitive by assumption, we can bring the ¢(d) into the inner sum and use our proposition about 


Gauss sums: now we have 


gOS we ) 2 sgn(m)g(m)ymk*q™, 


mn>0,n=c(u) 


and now the sum c can be summed over all n instead to get 


=e) », S- sgn(m)y(n)o(m)m*- 1 g™”, 


mn>0 mn>0 


and now the negativity condition tells us that the (m,n) and (—m, —n) terms are identical: we end up with 


= AHO S~ y(nyo(m)mi-tgr”, 


m,n>0 
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and we can rewrite this in a way that looks like the o,_1 coefficients in the original Eisenstein series: 
Cx9(b) _ 
= ADS [> (n/m)? |g”. 
n=1 \m|n,m>0 


We can then derive the following formula: 


Theorem 274 


We can rewrite the Eisenstein series as 


Cxg(b) -v, 
Gee = vk Be 


Ef? = 5(P)L(1— k,8) +2) of 4(n)q", 


n=1 


where 6(w) is 1 when the character is identically 1 and 0 otherwise, and we have the generalized power sum 


ops = > W(n/m)o(m)m‘* 


m|n,m>0 


So now we'll move to constructing the basis elements: 


Definition 275 
Let Ay., be the set of triples (w, 6, t) such that w,@ are primitive mod u and v with (wd)(—1) = (—1)‘, and 


t is an integer such that tuv|N. Let By, be the set of pairs (w’,x’) such that w’ and x’ are (not necessarily 
primitive) characters modulo wu’, v’ with u’v’ = N and (q'¢')(—1) = (-1)*. 


These sets are in bijection, because we can send 
(w, d, t) > ae Pw/tu) 


and 
(Wd) = (yg, u'/u) 


where w, @ are the primitive characters mod wu’ and v’ (the v’/v is implied because we can determine it from the value 
of N). This is important because By,x is easier to work with — we don’t need to worry about primitive characters — at 


first-order, the size of By, Is 


1 
5 > MAGNA), 


d|N 
and this seems to be also the size of €,([1(N)). This turns out to be true for all N > 4, and thus it makes sense that 


we can use the set Ay. x to make a basis (using our Ge functions): 


Theorem 276 
Let N bea positive integer and k > 3. Define Be) = EOE) then the set Be form a basis of Ex(T1(N). 


In particular, the set of ENG! with wd = x also give us a basis for the x-eigenspace €,(N, x). 


In particular, choosing w = 1 also gives us a basis of eigenspaces for I. 
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Diamond and Shurman 4.6 — Christian Altamirano 


In this section, we'll talk about the Eisenstein spaces for k = 2. We'll begin by constructing a basis for E2([(N)): the 


Weierstrass go function 


1 : 1 d 
@(z) = 4 yy (z—w) we 
weA 


helps us define the function (where we use the lattice Z + Zr) 


= 1 c(t + dy 
fy — N2 Or ( N ) 


which is a weakly modular function of weight 2 with respect to F(N). We'll try and expand that now: this yields 


/ 


il 1 1 1 
fy = ! 
2 (yt + dy)? © N? on (tte — og — d)° (ct + dj? 


Our goal is to compute the Fourier coefficients here. One formula that we'll use is the Fourier series of the following 


sum: 


Proposition 277 
For any T € H and k > 2, we have 


Bie =a eo ae 


deZ m=1 
and similarly if —T € H, we have 


1 =a 
area = =p SS Gamera a 


deZ m=—oo 


We'll use the same notation as in section 4.2, since there will be a lot of computation. 


Theorem 278 
We have ; 
fY (1) = G(r) — 25 G2(), 


where G(T) is the regular Eisenstein series of weight 2 with Fourier expansion 2¢(2) + 2C2 °°, a(n)q” and 


= Gye 
G3 (7) = 5(ev)6"(2) + 5 > OF (m) aN 
n= 


Proof. We'll split the sum for OT) into three parts: c = 0,c > 0, and c < 0. When c = 0, we'll also add the 


. Then we end up with a sum 


1 il ae | 
N2 a ( cyT+dy :) N2 >, d2’ 
deZ ( ~ d) 


N deZ 


1 
constant term tatap 


The right sum is just We 2¢(2), and the left sum depends on cy: if c, = 0, we just get ¢%(2), and otherwise the 


imaginary part of cyT + dy Is positive (because we only sum cy, from 0 to N), so we can use the Fourier expansion in 
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the proposition above using cyt + dy instead of 7. This yields 


1 co 
qe? yn)”, 
m=1 


2mi(cvT+dy 


where g’ =e )/N_ Then qd’ is simply related to g, so we can plug that in: this means that c = 0 contribution 


is just 


5(or)C*(2) + (1 — Slew) 53 Do wh” aff” — 52¢(2). 


For the c > O case, we want to figure out 


iL 
N2 p> (ot bd (ct +d)? 


2 
c>0 gez. ( re = d) 


We can apply the proposition for the second term again because Im(cT) > 0, and we end up with 
1 /\m 
N2 S- Com(q)”, 
c>0 


2QMiCT 


where q’ =e = q°. Reordering the sum by summing over mc instead, this becomes 


For the first term of this sum, we have a negative imaginary part, so we expand out the expressions there as well. The 


eventual result is that the c > 0 contribution looks like 


Co Co 
a D2 san(m)ui” | ah — a Do a(n)a”. 
n=1 ae n=1 
n/m=c,(N) 


And the c < 0 case is similar — the right sum is the same and we need to be careful about the negative sign for Im(r). 
Summing everything together, we do end up with the result that we wanted — for instance, things cancel out between 
c=Oandc <0. 


We'll now move on to the “correction terms” for these series: remember that the Eisenstein series Go(T) is only 


invariant when we add the correction term — So something similar occurs here when we correct our G> expression, 


may 
because 


f(T) = G3(T) — 362") 
is weight-2 invariant. 


Definition 279 


Define the weight-2 modular form 


g3(T) = G3(T) — N2 Im(7) = (7) 


To show holomorphicity everywhere, we can instead use the alternative method of proving the coefficients to be 


polynomially bounded: 


116 


Theorem 280 


The coefficients of g¥ are bounded by Cn7?. 


Proof. We can make an easy bound: all coefficients just look like 


loz(n)| < s Jsan(m)mpay"| < SO [ml < Cr? 


m|n,n/m=cy mod N m<n 


So we do indeed have a modular form in our above definition. 

With this, we'll move on to calculating a basis for E2(T(N)) and €2([1(N)): remember that the dimension of k = 2 
IS Egg — 1, SO we can get a basis by taking the cusp representatives and taking differences to remove one basis element. 
It turns out that €([(N)) has basis 


Veoo — Veo 1 Vi Eco } 


{oy , aD) 


We can also change these differences so that we use linear combinations over GY instead of g¥, as long as the 


corrections match up: 


Theorem 281 
The space €2(F(N)) can also be written as 


For €2(Fi(N)), we'll define the Eisenstein series Gi" (7) and EV?) analogously to how we did in the previous 
lecture for GP (r) and EV? (sr), 
Whenever w or @ is nontrivial here, Gres coefficients sum to zero, and also note that Gy € Ma(N,w¢) and 


that 
G(r) = a C29) ev egy, 


But when w, @ are both trivial, we won't get a modular form because of the correction term. But we can still use that 


case to get a modular form: 


Lemma 282 


For an integer t, the function 


OE. (n= tes lr )) = Goal 7) = Gal F)— tGa( tr) 


is a modular form. 


Proof. \t suffices to show that 
CoEP™ (7) = Go(7) 


for any T, and this is true because plugging in the trivial characters into the definition of E> yields 


i(-1, n)+2) (a? =C(- 1) +25 o(n)q", 


= n=1 


but G(T) is also of this form (we just need to check that 2¢(2) = C,¢(—1) so that the constants match up). 
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This means that we can finally define a basis for €2([4(N)): much like the last section, let Aj be the set of triples 
(w, @, t) such that w, @ are primitive characters mod u and v with (wd)(—1) = 1, and let t be an integer such that 
1 < tuv|N. (We don't want t = u = v = 1 here, so we're also excluding the tuple corresponding to the correction in 


the above lemma). 


Definition 283 
For any (w,¢, t) € Ano, define 


Fivticr) — tbh (tr =gd=1 
EX Ot (7) = 2 (T) 2 ( ) p=¢ 1 


ES? (er) otherwise. 


And this set turns out to be the basis for €2(T1(N)): in fact, the subsets 


{EY%" : (,, t) € Anz, 0b = x} 


will form bases for the x-eigenspaces E>(N, x). 


22 May 5, 2020 


Diamond and Shurman 4.7 — Michael Tang 


We'll be discussing the Bernoulli numbers and Hurwitz zeta function — this is mostly preparation for the next lecture. 
This will give a few formulas that are important about weight-1 Eisenstein series. 

As a roadmpa, the first part of this talk will be about Bernoulli numbers (derived from the context of power sums) 
Bernoulli polynomials, and the Bernoulli numbers of a Dirichlet character. Then in the second part, we will do some 
sketchy complex analysis to generalize both ¢ and the modified ¢ function from previous lectures, and we'll relate them 


to Bernoulli numbers with a formula. 


Definition 284 


The kth power sum is defined as 


Qa 
Sk(n) = So mk = OK 41k +--+ (0-1), 
m=0 


where we define 0° = 1 and 0% = 0 otherwise. 


We're zero-indexing here so that computation later will be a bit more convenient. 


Definition 285 


The generating function for the kth power sum is the exponential generating function 


0° k 
SGaie YS 


k=0 


(Here, we should think of t as our variable and n as a parameter. ) 
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Lemma 286 
We have 


Proof. We have 


ee) tk co n-l tk 
S(n, t)= 5° Sx(m) = = S(n, t) = Ie > me 
k=0 k=0 m=0 


and swapping the order of summation turns this into 


and now the inner sum is the Taylor series for e’"'. Evaluating the subsequent geometric series yields the result. 


We'll split this fraction up as 
et—1 ¢ 
t et—-1 
This is because both functions here have a limit as we approach 0, and we can write both of these more easily. 


S(n, t) = 


Definition 287 


The Bernoulli numbers are the coefficients of the exponential generating function 


Substituting this in, 


and now we substitute in the Taylor series for e”*, reparameterize the exponent of t after simplification, and we end 


up with 


love) k 
1 k+1 -\ tk 
B, k+1-j ae 
ee ; ) a k\ 

k=0 Jj=0 


But remember that S(n, t) is a generating function, so the coefficients must match up: this expression involving 


Bernoulli numbers must give us the power sum Sx(1n). We'll take a part of this expression: 


Definition 288 


The Bernoulli polynomials are defined as 


B,(X) = x (‘) BX, 


(Notation-wise, By, denotes the number, and Bx(X) denotes the polynomial.) So we can rewrite 


k 
1 (a. eee 1 

5 = —— : Bas, BY xX\=——l(e —B 
k(n) pi F ) in K(X) aa x+1(7) — Br+1) 
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(where we need to subtract off the last term of the sum in the definition of the Bernoulli polynomial because of the 


limits). 


Example 289 


We can look at the case k = 2. 


Then we know that Bp = 1, By = —5, Bo = z, and B3 = 0 by looking at the Taylor expansion of x. so 
3 2 4 ye, tl 
B3(X) — B3 = BoX” + 3B, X* + 3Bo(X) = xX” - 5X + 5* 


Substituting in X = n+ 1 means that we have a formula for the sum of the first n squares — this naturally generalizes 


to finding a formula for the sum of kth powers. 


Definition 290 
Let w : Z/uZ be a function (which doesn't need to be a Dirichlet character, but that’s the case we care about). 


The Bernoulli numbers of w, By y, are the constants such that 


lee) tk val tect 
Dd. Baw = dV) Sea: 
k=0 c=0 


This may seem a bit unintuitive, but it has nice properties. If we plug in the trivial character x = 1, then By. y = Bx 


t 


because the right hand side simplifies to =. 


And in fact, we can solve for By. y in terms of the Bx: 


Bay = oS 9(e8s (£). 
c=0 


This is proved by swapping the order of summation again (plug in the generating function for the Bernoulli polynomials 
Bx(c/u) on the right-hand side and do some more algebra). We'll need the k = 1 case in the next lecture, which says 
that 


u-1 a 1 
Biaij= y ———). 
Ly wc) (; 5) 
c=0 
Let's move on: 


Definition 291 
Let r € (0, 1]. The Hurwitz zeta function is defined for all Re(s) > 1 by 


ee 
s.n= So. 
ae Dien 


This converges absolutely for Re(s) > 1 by the same argument as for ¢. Notice that when we plug in r = 1, this 
yields the usual Riemann zeta function (notice that the Hurwitz zeta function starts the sum from 0 instead of 1), 


and when we plug in a rational number r = 4, we have ¢(s, r) = N°C4(s) (both of these can be directly checked). 
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Theorem 292 
Let w #1 be a Dirichlet character mod u. Then the L-series 


ae B 
L(1—k,w) eye le = - 
n=) 


In other words, we will be able to evaluate the L-function for Dirichlet characters at non-positive integers. 


Proof sketch. First, we can relate the L-function to the Hurwitz function: 
u Cc u 
Yo (o)¢ (1 =k, S) =u wee =k) = ULL = kW), 
c=1 c=1 


where we've used the formula for ¢(s, r). We'll work more with the left hand side here: lots of complex analysis gives 


us the analytic continuation 
r(1—s) ZO 


eZ —1 Zz! 


C(s.F)= 


271 7 


for all s € C, where y, (for € € R) is a contour which goes from —co to —€, does a counterclockwise circle, and goes 
back to —oo. Showing this requires the use of the Mellin transform, which we'll talk about later on. 
Substituting this into our above formula yields 


weia 6) = -O f Yom LPe Se. 
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Then if we apply the Cauchy integral formula to the integral around the circle, we indeed end up with the desired result 


(the integral along the line segments in y,- cancel out). 


So we now have a formula for our L-series on the nonnegative integers, and again we'll care about the k = 1 case: 
this reduces to 
L(0, py) = —By yp. 


What's interesting here is that the series L(0, w) is technically supposed to diverge, but we assign a value using this 


analytic continuation. 


Diamond and Shurman 4.8 — Zack Chroman 


The goal of this section is to continue to find concrete bases, but this time for the weight-1 eigenspaces €,(N, x) and 
Ex(Ti(N)). We'll mimic the k > 3 case as much as possible, but this is the most ugly case. Our approach will be to 
define a function that’s almost the Weierstrass go function, which will help us define a weight-1 modular form g,. This 


will then extend to a function G; which will allow us to define the basis elements Gi? and E@:?. 


Definition 293 


The Weierstrass zeta function is defined as 


Z\(Z) = 
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The proof that this converges Is similar to the proof that the Weierstrass function converges, and it turns out that 


we have 
Z4(z) = -@(z). 
This function will not be doubly periodic like @ is, but its derivative is doubly periodic (as already established), so we 


get the lattice constants 
mA) = Z2.(Z +41) — Za(Z),  m(A) = Za(Z + we) — Za(Z) 
(because their derivative is 0 — this is the same argument as usual). 


Lemma 294 


Let W1,W2 generate our lattice A. Then we have 


N2(A)wi — m(A)we = 201 


WwW 
when ae € HH. 


Proof. The proof mimics the way we understand the Weierstrass go function: we have a fundamental parallelogram 


with boundary OP, and we can translate this by t so that there are no poles on the boundary. The residue formula 


tells us that 
| Zn(z)dz = 2n71 
t+aP 


because the residues are 1, but we can also break this up into parts as 


[ee t+ t)— Za(z+w2 + t))dz4 Za(z+t)+ Za(Z+w1 4 t))dz 
0 0 


(one term for each side of the parallelogram), and this is exactly n2(A)w1 — 71(A)wo. 


We're going to black box some facts: it turns out that when A is generated by 1 and 7, we have 


ieee e2Miz oo e2nizq" e72nizq" 
Zn,(Z) = N2(Ar)Z — TI Teams = ~ 27 S- € a amie | 
n=1 


where q = €27'7. We also have that 72(A,) = G,), and therefore ;(A;) = TGo(T) — 277/ by the previous lemma. 
Now we can define our functions gi: let A be generated by w and ws. Fix an integer N and a vector (cy, dy) € 
(Z/NZ)*, and we define 


F(A) = Zn (4 5a) cym(A) —_ 


We can check that the contributions cancel out if we add N to c, or dy, so this is well-defined. When we specialize 


to the case where A is generated by 1 and T, we also define 


7 1 CT +d cym (Ar) — d272(A,z) 
ofr) = Zn, (SF) oO 


Lemma 295 


g, is weight-1 invariant. 


We'll also black box this fact. Our short-term goal with this is to plug in the formula for Z,,(Z) into this formula 
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2nit/N 2a i/N 


CyT+dy 
Ny as we 


here for g, to extract more information. We'll define z = q= eT q=e , and Uy =e 
have in the past few lectures. 
We'll go step-by-step from here when we plug in the complicated formula for Z,,. It turns out that the first and 


last terms cancel out nicely: 


N2(Ar)(Z) cym (Az) — dvn2(Ar) os 2T1Cy 

N N?2 N2 
We'll now deal with the : 
wil = e2tlz 
~N = e2TizZ 


term. In the case where c, = 0, then we just have z = dy, and this simplifies to q cot 2% 


Ee Otherwise, we'll expand 


as a geometric series to get 


ee e2tlz 
; Spiiz =—14 >> e2tim(evt+dv)/N 142 » qn He 
ae m=1 
meaning we can just write 
wil + e272 T Tdy 


N1— e272 a a) a cot N + (1— 6(c¢,)) (-H+§ ee ae qa ac dym } 


where 6(c,) is the indicator function for Cy being zero, and C; = —2m7/ (to draw an analogy with the previous cases). 


Finally, we need to deal with the other sums. First, we deal with 


[oe) _ ; n 

e 2Qmizq 
>» i e-2mizq" 
n=1 


We can expand out the geometric series, replace g with Ce and replace n with (cyt + d,)/N, and we rearrange in the 


same way as the last sum to get to 


co [oe) 
: . eg nee 


n=1 m=1 


The gy term is the important part, so we can reparameterize as 


=> utah 


k=1 <0 
elk 
k/L=cy 
where k = nmN — mc, and £= —m. 
Now, we need to deal with 
Le 
e2tizq” 
pS e2mizq"' 
n=1 


and we get to the same kind of intermediate statement 


co [oe) 
5 5 e2timd, /N Gyen 


n=1 m=1 


But this time, we overcount some extra terms, so we need to correct that: it only happens for the n = 0 term, which 
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doesn't appear in our original sum. This yields 


=10 Do ent ah | — (2 d(ev)) D0 ni an”. 
k=1 m>0 m=1 
mlk 
k/m=cy 


And now we can write out all of our terms together: putting together the 6(c,) terms between all of our calculations 


will make things simplify further, and we end up with 


= 2QTICy de Cine k > Ci f[/q 1 
G(T) = am + (cv) (D+ yd, c8(0 ah = do eearrad Ort cect 
where i 
sy] S) aye 
m|k,k/m=cy 
and 


G(r) = ace (1) + (>: cit | 
k=1 


So we've now defined g; and G;, and because the Fourier coefficients for g; are bounded, this is actually a modular 
form with respect to F(N). 
From here, we can define the series with respect to Dirichlet characters as well: if uv = N, w is a primitive 


character mod u, and ¢ is a character mod v, we can define 


u-—1v—-lu-1 


GPF (r) = S SPY Wea gh (7) 


c=0 d=0 e=0 


and we'll define 


EP ?(r) = 6(b)L(0, ¥) + 6(W)L(0, 6) +25 of 9(n)q”. 


n=1 


We're claiming that these objects are actually the same up to a constant: 


Gye = awe Eve 


where g(@) is the Gauss sum. In fact, we will establish a result similar to that of previous classes: 


Theorem 296 
Let A, be the set of triples ({¢, w}, t) (unordered pairs here) such that tuv|N, ~é(—1) = —1, and yw, ¢ are 
both primitive (mod u and v, respectively). Define EY" (7) = EY®(tr). Then feeehe : ({w, bd}, t) © Ani} 


form a basis for the Eisenstein space of [,(N), and for any character x, 


{EP?* : ({b,, t) € Ans. (Ov) = x} 


forms a basis for the x-eigenspace. 


Proof. Everything except the constant terms actually work out exactly like our earlier proofs, because G; and EF are 
the same as the k > 3 definitions except for the constants. But finding this constant is not very easy. We'll break 


down the constant term for > g; into the sums for Gz (which behaves like G, for larger k) and (g; — G1). Summing 
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over c,d, e means we can simplify the (gi — G,) term to 


C u-1 c 1 v-1 u-1 
1 = 
-SY wo (#-5) Vay. 
c=0 d=0 e=0 
But now the inner two sums are zero unless we have ¢ being the trivial character 1, in which case v = 1 and this all 
simplifies to 
N-1 = > 
= vw 1\ Gg) _ Cig() 
= -C8(0) 2 HO) (F— 3) = SPB w = SPPaOLO.W 


by using properties from Michael's lecture. 

The idea here is that when ¢ = 1, we can multiply by things like g(@) and v which are actually just 1, and then we 
can simplify to a form that we want. 

On the other hand, the triple sum constant term has already been computed: it's going to be 


v-1 


(0) 5 G(d)CA(A). 


d=0 
This is similarly zero except when w is the 1 character, and we'll write this as a limit as s approaches 1 of 


Vv 


 H(a%(S)=S> ST Hoy = (1 (-1))L (5,8) = YL. 8). 
d= d n=d mod v 


Oo 


Note that we have a functional equation for the L-function, so L(1,@) can be written in terms of L(0, 6) (in general, 


we can write L(k, f) in terms of L(1 — k, #)). So we end up with a total constant term 


Cig(b 
V 


((Y)L(0, w) + d(H) L(0, o)). 


And this is exactly the constant scaling that we wanted when we defined EY’. 
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Diamond and Shurman 4.9 — Dhruv Rohatgi 


We'll be discussing the Fourier and Mellin transforms, and there won't really be very much number theory in this 
lecture. The idea is that we've claimed a lot of holomorphic and meromorphic functions can be analytically continued 
— we proved this using a functional equation for the Gamma function, but we didn't really do it rigorously for the zeta 


function. So we'll be discussing the € function, defined as 
e(s)w 1 (s/2)c(s), 


and we'll show that € has a meromorphic continuation to all of C, which will show that ¢ does as well. 


Recall that the (é-dimension) theta function 
a(7,0) = So entre 
neZzé 


is holomorphic, because we’re bounded away from the real axis for any compact subset of C. We'll use a Poisson 


summation to prove a functional equation, which will allow us to construct and understand €. 
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Theorem 297 


For all t > 0 and integer 2 > 0, we have some symmetry along the imaginary axis: 


OG?) = Se ee ee a ae): 


neZet neZt 


We'll get a functional equation 
6(-1/T, £) = (—i7)*/76(7, £) 


as a corollary of this result. 


In order to prove this functional equation, we'll need to introduce the Fourier transform: 


Definition 298 


The Fourier transform of an absolutely integrable f : R* — C is defined to be 


fy) = f Foyer ay, 
Ré 


Proposition 299 


For “sufficiently nice” f ; R® + C, we have 


So SE A: 


neZt meZt 


Lemma 300 
The Fourier transform of a Gaussian is another Gaussian with a scaling factor: if f(x) = eT!’ is a function 
from Ré > R, then 

f(y) = po/2 eam? /t. 


Proof. This is a “contour shifting” proof: 


f(y) -| en Ttlx? Q-2mi(x.y)dx = eae f en ttlxtiy/tl? dy. 
Ré Ré 


But integrating over a shifted axis is the same as the regular integral of a Gaussian (by Cauchy’s theorem), and we 


can evaluate it to get the desired result. 


Applying this lemma and the Poisson summation formula gives us exactly what we want to prove the above theorem. 


Definition 301 
Let f : Rt > C be a function. Then the Mellin transform of f is 


gs) — ie F(t\r dt, 


This is nice because it has an inverse (and other nice properties), but we mostly care about it for motivation. 
Notably, if we pick f(t) = 5(@(it) — 1), then 


g(s) = 5 foe 0 aires 
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Lemma 302 
When Re(s) > 1/2, g(s) is well-defined, and 


g(s) = Ww °T(s)¢(2s) = €(2s). 


Proof. We check that we don't have “problem points” at t > 0 (since 6(/t) might explode) or t > oo (since tS~! 
might explode). Break up the integral into a part from 0 to 1 and a part from 1 to oo. The latter decays exponentially 


because |@(it) — 1] < 25072, e-™"t and to understand the former, use the functional equation to find that 
(it) = t-/76(i/t), 


so 6(i/t) goes to 1 as t > 0, meaning 6(it) is asymptotically t~!/. This means that when we multiply it by tS~! for 
any Re(s) > 1/2, this integral does indeed converge. 


To show that this is related to the € function, we can write out the definition of the theta function: 


1 f= co 00 
(s)=5 f (lit) aye tde= [ewes tat, 
: 0 n=1 


where we've summed over positive integers instead of all integers and removed the factor of 2. We can then use the 


dominated convergence theorem to swap the sum and integral to get 


oo lo) 
= ae err te de, 
n=1 0 


Substituting u = mnt, the integral becomes independent of n, and we end up with 


co 


= remy f e "us" du, 
n=1 0 


which miraculously simplifies to ~*¢(2s)I'(s), as desired. 


So we've shown that we have another expression for € on the same domain it’s defined, and now we can show that 


g extends meromorphically to C — this is easier than working with € directly. We again split into its two pieces: 


g(s) = [eu = edt [eis _ edt. 


The second integral is already holomorphic, so we don’t need to worry about it. The first integral is essentially going 


to look asymptotically like 
1 1 


i 
a —1/2 s-1 a os 
| (t 1)t dt iWa=y - 


and notice that the function on the right side is indeed meromorphic everywhere on C. So we can analyze in terms of 


an “error term:” whenever Re(s) > 5, we can first integrate the polynomial term 


1 1 
| (@(it) — yetae= [ a(it)ts tdt— = 
0 0 s 


and then do a u-substitution t = i to get 


g(s) = [ oc (as ee oe - _ wo 
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But this extra term is meromorphic, and that’s the same thing as saying that g (and therefore € meromorphically 
continue). And notice that there's a symmetry in this expression: g(s) = g($—s), and since g(s) = @(2s), this 
means that €(s) = €(1 — s). So we've shown that € extends meromorphically to all of C, with only simple poles at 0 
and 1, with a simple functional equation. This allows us to understand the poles of ¢ pretty clearly as well (it’s just at 
z= 1). 


Diamond and Shurman 4.10 — Swapnil Garg 


In this lecture, we'll meromorphically extend the Eisenstein series using the Mellin transform we just introduced. The 
idea here is that in Serre, we discussed the Eisenstein series for SL2(Z) for all even weight k > 4, and similarly we do 
this for k > 3 for congruence subgroups (where the weight doesn’t necessarily have to be even). Eisenstein series also 
exist for k = 1,2, but they're harder to deal with because they don’t converge. There's a lot more bashing that we 
have to do, but we'll look at these series directly by adding a complex parameter s. We will then analytically continue 


our series to s = 0, which is theoretically what our series should look like. 


Definition 303 
Let V be a vector in (Z/NZ)? of order N, let k be a positive integer, and €y be defined as usual (5 for N = 1,2 


and 1 otherwise). Let v be a lift of V to Z?. Then for any tT = (x + iy) € H and s € C, define the augmented 


series via 
Im(7)* 


(a7 deer d= 


Els) en y 
(c,d)=v mod N 
gcd(c,d)=1 


Note that whenever Re(k + 2s) > 2, this series converges absolutely, and because it converges uniformly on 
compact sets, this makes the series analytic on that particular half-plane. 


We're going to rewrite this series using weight-k operators: we'll generalize them to two parameters by letting 


(flalx)(7. 5) =i(. 7) “F(y(7), 5), 


where j(y, 7) is still ct + d. We showed earlier on that some functions f satisfy f[y]x = f, meaning that we are 


weakly modular of weight k. So if we let 6 be an element of SL>(Z) with bottom row (cy, dy), and we let P, be the 
lon 

positive part of the parabolic subgroup of SL2(Z) (meaning we consider the elements ; i where n Is a positive 

integer), we have the following result: 


Proposition 304 


The Eisenstein series can be rewritten as 


ELS) = ey Se Im(7)°[y]« 


YE(PLAT(N))\T(N)6 


(where we sum over the orbits of Py NT(N) inside [(N)6). 


7 b 
Proof. Compare this expression to the original definition of E/(7, 5). First, we can check that for any y = ? ; in 
Cc 
SL2(Z), we have 
Im(7) 


Im(a(r)) = a 
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so that means the Im(7)* and |cr + d|?5 cancel out the relevant factors. Looking at the stabilizer of [(N)6, two 
matrices in [(N)6 will have the same bottom row if and only if they are in the same orbit of P, MI (N), because [(N)6 
only contains matrices with bottom row (c,d) = v. This means that we know what the weight-k operator does: 


Im(7T)°* 
(er a)*ler ar 


Im(7)*Fy]x = (eT +d)" Im(¥(7))° = 


and plugging this back in and looking at what we sum over (each orbit corresponds to a specific vector (c, d)) yields 


the result. 


This yields the following formula, which should look familiar: 


Lemma 305 
We have 


(Exly|«)(7. 5) = Ex*(7, 5). 


Proof. This is the same as the proof in Section 4.2: we just sum over the orbits in a different order and gain an extra 


factor. 


In particular, for any y € ['(N), it is true that (EY[y]x)(7, s) = E(t, 5) (because Vy = V). TO proceed, we can 


look at the non-normalized Eisenstein series 


Gi{r,s)= S- uly 


ee (ct + d)k|cr + dl?s’ 


Remember that G and E are basically the same — there are some linear combinations — so if we can meromorphically 
continue one, we can meromorphically continue the other. So we'll only look at the case k = 0 and N = 1 today, 
which is very nice, and we'll see the general case next time. 


So now we can look at the modified theta function 


Oy) = DU erin 


neZ?2 


where y € GL2(R). 


Lemma 306 
For a function f € L4(IR?), a matrix y € SLo(R), and r > 0, the Fourier transform of a general function (for 


x € R?) o(x) = F(xyr) is O(x) = 2 F(xy~"/r), where y~7 denotes the negative transpose for +. 


We're skipping over this proof for now — it’s just a fact about Fourier transforms. So now if we use Poisson 


summation and use the function f(x) = e7™*’, we find that 


ry fir) ar? try 7 /P). 


neZ? neZ2 


0 -1 
for any r > 0 and y € SL2(R). So now if we let S = ~ a |" then Sy~? = 7S, and |x| and |xS| are the same. 


Since f is a Gaussian function, this means f(x) = f(xS), and S is just a 90 degree rotation of our lattice, so we sum 


over the same thing. Thus we get the following result: 
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Corollary 307 
We have the transformation law by using the fact that 0(7) = Yyj,eq2 eO7"’. 


ro(yr) = — 


Now we look at something similar to in Dhruv’s lecture: note that the Mellin transform of 8(yt!/2) — 1 


g(s.7) = [ (a(t? — ys =f So evtinnty 


néZ? 


(note that letting -y = // makes this the same as the previous lecture), where subtracting the 1 in the first expression 
removes the point at the origin (which is why we have a )~’). The transformation law tells us that &(yt!/2) must be 
asymptotically proportional to + near t = 0, because our function goes to 1 at infinity. That means that our integral 
for g(s,) converges at t = 0 as long as Re(s) > 1. But this function also converges at all s for t + oo (again, 
by exponential suppression). So we can move the sum outside the integral because of convergence, and changing 


variables with t replacing a|ny|?t yields our gamma function again: 
=p 07 = 
gs.) = (elm? ae fre eat TG) 3 |ny|-7*. 
neZ2 neZ2 


So now we can connect this to the function Go(T, s): for any T = x + /y in the upper half-plane, define 


—_ lily x 
Yr Vv lo 1 , 
which sends / to T. Since |(c, d)yr| = |ct + d|*/y (just by computation), we find that 


9(S,¥r) = 1 °T(s) 5 


peer |CT - “ape 


which is exactly the expression m°I(s)Go(T, s) for the augmented Eisenstein series when k = 0 and N = 1. A bit of 
algebra analogous to the last section (a change of variables), plus the transformation law, yields 


1 
1-s 


[eae net = [Poor -vert 


But notice that this integrand converges quickly — there are no problem points — which shows that g(s,y) only has 


simple poles at s = 0 and 1, and we have g(s, y) = g(1—s,7). So 
mw 5T(s)Go(t, s) = > 41 (1 — s)Go(7, 1-5), 


and we've defined our weight-O Eisenstein series (in a simple case) with a meromorphic continuation. This function is 


SL>2(Z)-invariant and of weight 0, so it is weakly modular. 
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Diamond and Shurman 4.10 continued — Shreyas Balaji 


We'll start with some definitions from last lecture: we have 


(7) = Do ely € GL2(R), 
neZ? 


1 ly x : 
_=— € SLo(R), T=x+Iy, 
7 ak i 2(R) y 


Gi(iris)= S- x 


k 25° 
c,d=v mod N (cr ay d) ler + d| : 


Last time, we worked with these objects by considering the function @(yt1/2) — 1, and we considered the Mellin 
transform g(s,), which was both equal to 1~*P(s)Go(7, 5) and an meromorphic function in s 


1 
1l-—s 


asin) =f (o(rtt?) — ie + FS 


which has a meromorphic extension to the full complex plane. Our goal today is extend this logic to higher weights 
and levels k, N. 
We'll introduce some notation: let G be (Z/NZ)* (we'll be treating these group elements as row vectors), let 


un = e2™/N and we'll be considering arbitrary functions a : G — C instead of vectors (they don’t need to be 
0 -1 
homomorphisms). We'll let (-,-) denote the standard inner product, and we'll also define S = : 4 } Note that 


S$’ =~—S, which is nice for inner products which we'll be working with throughout this lecture. 


Definition 308 


The Fourier transform of a function a: G > C Is given by 


Proposition 309 


We can invert the Fourier transform to get 


aoe! oe 
ao) — ry S avy 
VEG 


; . eS er i 
Proof. There is some basis vector e; such that ue a # 1 as long as U is nonzero, and then we can “sum over the 


group” in two ways by the transformation v + v +e, showing that the sum )0 jeg no will be O for all 7 4 0. (And 


when U7 = 0, that sum is N?.) So we can plug in the definition for 4(V) and swap the order of summation, which will 


yield the result. 
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Definition 310 


The harmonic polynomial for a positive integer k, denoted h,, is defined as 
hee.) = (=) (era) 


for (c,d) € R?. 


Definition 311 
Define the theta function 6¥ : GL2(R) > C to be 


BL(y) = D> he((v/N + nye MONA 
neZ2 


For notation’s sake, let f(x) = e~7*’ be the Gaussian function and f(x) = h«(x)e7™*? be the Schwartz function. 


There's lots of useful facts we'll need before doing some algebraic manipulation: 
* The Fourier transform of the Schwartz function f is (—i)‘ f. 
* Define ¢x(x) = f(xyr) for some r > 0: then the Fourier transform is (—/)*r~?(xy~"r7?). 


+ Expanding out the definition of hy and defining z(x) = c + id for x = (c,d), we can write 
hy (x) = (z(xS)*. 


* This means that hy(xS) = (—i)Khy(x), and &(xS) = (—i)* hg (x). 
* Finally, Sy~’ =yS and S’ = -—S. 


So now we have a lot of algebra to get through: we expand out r8¥(yr) from first definitions and write it in terms 


of the Schwarz function and then ¢, (by definition) to get 
=r So f((v/N + n)yr) =r S > bk(v/N +n). 
n€ ZZ? neZ2 
Recall the Poisson summation formula: applying it to our function (where we use the Fourier transform) yields 
=r S- b(n) e2tiir-v/N) = (=i S- flney Tp ets), 
neZ2 ne€Z2 
Substituting n > nS and swapping around a few terms yields 
= (-i)kr So (=i) (ay ag”? 
neZ? 
and from here, we break up our sum by group elements in G: 
- -1),,—(w,vS 
=(-1)'r? 5° S- Alay), 
WEG n€Z?2,n=w mod N 
and now this Is exactly the sum that we want in the definition of our theta function: some more algebra yields 


= (1) S- OU (yNr)uyerr?. 
WEG 
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If we now look at aa as a function of V, then our last sum looks like N times the Fourier transform. Adding that factor 
back in yields 
r6X (or) = (-1)‘Nr 387). 


And since r is arbitrary, we send r > N1/2r, and we get the final identity 


POV (yn?) = (—I)kr 7 87(qn¥? 4) | 


Proposition 312 


Let a: G > C be a function. Then the sum-theta function 


Ox(Y) = S (av + (—1)ka(—v) ol vy?) 


VEG 


satisfies rO2(yr) = r-'O2(qr-*). 


To show this, we'll need a lemma that relates the Fourier transforms with swapping v > —Vv: 


Lemma 313 


For any function a: G + C, we have 


SY awporly) = Yo a1) 89), 


VEG veG 


5 A(-V) OX (7) = YF a(V) 8K (7). 


VEG VEG 


Proof of lemma. To do this, we can prove the general statement 


Yo a(x) b(x) = YO a(-y)B(y) 


xE€G yEeG 


by writing out the Fourier transforms and swapping the order of summation. Then substituting in b= or or oY yields 


the desired result. 


Proof of Proposition 312. Write out the definition of sum-theta function, apply the boxed identity above, and then 


use the lemma to move factors of —1 between the a functions. 


This is a generalization of what was shown in the last lecture. Note that O2(yr) rapidly converges to 0 as r + 00, 


so the following Mellin transform of ©7, denoted 


a " a 1/2 gat 
9 (S,Y) = one jt “et 
0 
converges as t — 0 for all s because of the proposition which essentially allows us to “swap” t?/2 with t71/2. 
More specifically, the integral converges as t + 0, as well as t — oo. So we can expand the sum, swapping the 
sum and the integral, expanding out the expression for the theta function, and then factor out all the terms that we 


can that don't depend on t, using the trick where we sum over all 7 = v mod N instead of all n € Z*. The final result 
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we find is that 


/ 


ge(s.y) =m */?-SNST(k/2 +) ¥-(a(v) + (-1)*a(—v)) SO ig (ry) ny. 


VEG n=v mod N 


Now defining y = 77 yields hyx(ny,) = (eT + d)k/y*/?, and we also have that |my|~*~25 = y*/2+s /|cr + d|k+?5, so we 


can substitute this in and simplify to find that the Mellin transform of the sum-theta function (specifying to y,) Is 
g2(s, Yr) = w*/?-S7(k/2 + 5) NSy*/?G2(r, 5 — k/2), 
where G7 is the sum of Eisenstein series given by 


Ge(r,s) = )_ (av) + (-1)ka(—-V) GE (YN"?). 
VEG 


And now we do the same thing with analytic continuation: applying our above proposition on the Mellin transform 


integral lets us relate values of ©? between the regions [0, 1] and [1, oo], and the transformation yields 


; dt = dt 
i; Spy?) -| O72 (ytl/?)(t8 oy ae re 
0 1 


which has symmetry under s + (1—s). And this last integral is entire in s, so combining everything gives us the final 


theorem of the section: 


Theorem 314 
For any positive integer N, let G = (Z/NZ)*. Construct G?(r, 5) as before: then for any integer k and any point 


T=x+iy EH, 
G/N) -Tk|/2 4 s)Gp(7,s — k/2) 


has a meromorphic continuation in s to all of C which is entire when k 4 0 and has simple poles at 0,1 for k = 0. 


(Specifically, we've proved the k > 0 case, and the others follow by a similar argument.) 


Diamond and Shurman 4.11 — Anton Trygub 


We'll make this presentation as short as possible to leave some time for celebration at the end of the class. Basically, 


we'll introduce a new theta function and show a few important properties. 


Lemma 315 


Let d be a cubefree positive integer. The equation x? = d mod p has 3 solutions if p = 1 mod 3 and d isa 


nonzero cube mod p, 0 solutions if p= 1 mod 3 and d is not a cube mod p, and 1 if p= 2 mod 3 or p|3d. 


Proof. The result is clear for p = 3 or p = d (because 03, 13, 23 are all different mod 3, and d = 0 is the only solution 


in the latter case). otherwise, let g be a primitive root mod p: letting d = p*, the equation reduces to showing the 


number of solutions to 3x = k mod (p— 1), and the result follows. 


We'll denote e(z) = e277 and tr(x) = x + x* throughout the next few results. We'll also use the notation 
A = Z[p3],0 = iV3,B = <A (note that AC BC $A as lattices). Some useful facts to remember are that if we 


write X = xX; + Xo for integers x1, Xo, 
[x]? = xP — xix. +x$, 0 [x t yl? = |x? + ly? + tr(xy’*). 
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Definition 316 
Let N be a positive integer, and let U € 3A/NA. Then we have the theta function 


6"(r, N) =De(m lq +f 7) 


neA 


for 7 € HL. 


Our first result tells us about this function when we move along the real axis: 


Lemma 317 
We have 


e(r +1, ny=e( ils ) er N) 


for any positive integer N and v7 € B/NA. 


Proof. We have that 


u |u|? 
N| a +n Sy mod Z 


by expanding out u = uy + Upp and n= ny + Mow. Since e(a+ b) = e(a)e(b) by definition, replacing 7 with 7 + 1 in 


the definition will give us an extra e (wv | aoe n| ie and this simplifies using our first observation because e Is 1-periodic 


in the real direction. 


The next result is a “scaling:” 


Lemma 318 
We have 
CE Gas) ae Se cla clit) 


VeEB/(dNA) 
V=n mod NA 


for any positive integer N, U € B/NA, and positive integer d. 


Proof. Let n= r-+dm, and substitute into the definition: 


_ 2 
(rn) = Dre (Mx +A] r)= > “(wv 
i? oh 


Ea 3. - 
GN m| dt 


where we've introduced a factor of i inside the absolute value square but canceled this out with the two factors of d 


outside. But now the sum over m can be rewritten as a theta function as well, which yields what we want. 


Finally, we describe how @ transforms under the “action” of S: 


Lemma 319 
We have 


oa) x oS) etm 


VEB/NA 


for any positive integer N and u € B/NA. 


This result takes a bit more effort to prove, so we'll leave it as an exercise. 
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